Method for the isolation of novel antigens

ABSTRACT

Methods for isolating novel antigens that stimulate CD4-positive T cells are disclosed. The methods provided comprise the steps of: transforming a host cell with a plasmid suspected of containing such a DNA sequence encoding such an antigen and inducing antigen expression; incubating the transformed host cell with at least one dendritic cell for a period of time sufficient to form a peptide/MHC Class II complex on the dendritic cell, the peptide being derived from the expressed antigen; incubating the dendritic cell with CD4+ T cells; determining the level of CD4+ T cell stimulation, thereby identifying a transformed host cell that expresses at least one CD4+ T cell-stimulating antigen; and isolating DNA from the transformed host cell. DNA sequences isolated using such methods, together with antigens encoded by such DNA sequences, are also provided.

TECHNICAL FIELD

[0001] The present invention relates generally to methods for the isolation of novel antigens. The invention is more particularly related to methods for isolating antigens that stimulate T cells.

BACKGROUND OF THE INVENTION

[0002] Both CD4+ and CD8+ T cells play a key role in the body's ability to mount an effective immune response to many disorders. For example, CD4+ T cells have been shown to be important in immunity to bacterial pathogens and also in mediating autoimmunity. While both CD4+ and CD8+ T cells recognize peptide/major histocompatability (MHC) complexes expressed on antigen-presenting cells wherein the peptide is derived from an antigen associated with a particular disorder, CD4+ T cells recognize peptide/MHC Class II complexes and CD8+ cells recognize peptide/MHC Class I complexes. The identification of expressed peptide/MHC Class I complexes has led to the isolation of several novel antigens that stimulate CD8+ T cells. However, the isolation of CD4+ T cell-stimulating antigens has been more problematic due, in part, to the large amounts of purified protein with which antigen-presenting cells must be incubated in order to obtain detectable CD4+ T cell responses, the small amounts of peptide/MHC II complex expressed by the antigen-presenting cells and the heterogeneous size of such complexes.

[0003] Sanderson et al. (J. Exp. Med. 182:1751-1757, 1995) describe a method for the isolation of antigens that stimulate CD4+ T cells wherein they first constructed a T cell hybrid containing a reporter gene by immunizing mice with Listeria monocytogenes (LM) to provide LM-specific CD4+ T cells which were subsequently used to form lacZ-inducible CD4+ T cell hybrids. An LM genomic DNA expression library was transformed into E. coli and, following antigen induction, the transformed E. coli incubated with syngeneic peritoneal macrophages to generate peptide/MHC class II complexes. The presence of peptide/MHC II complexes was detected by probing the macrophages with the lacZ-inducible LM-specific CD4+ T cell hybrids and staining with X-Gal substrate to visualize activated lacZ-positive T cells. The method of Sanderson et al. is technically complex and involves many steps, thereby being time-consuming and unsuitable for high-throughput use.

[0004] Accordingly, there is a need in the art for improved methods for isolating novel antigens that stimulate CD4+ T cells. The present invention fulfills these needs and further provides other related advantages.

SUMMARY OF THE INVENTION

[0005] Briefly stated, this invention provides methods for identifying and isolating novel DNA sequences that encode CD4+ T cell-stimulating antigens, such methods being less technically difficult and less time-consuming than prior art methods, and more suitable for high-throughput use. In one aspect, such methods comprise the steps of: transforming a host cell with a plasmid suspected of containing such a DNA sequence and inducing antigen expression; incubating the transformed host cell with at least one dendritic cell for a period of time sufficient to form a peptide/MHC Class II complex on the dendritic cell, the peptide being derived from the expressed antigen; incubating the dendritic cell with CD4+ T cells and determining the level of CD4+ T cell stimulation, thereby identifying a transformed host cell that expresses at least one CD4+ T cell-stimulating antigen. In another aspect, the inventive methods further comprise isolating DNA from the transformed host cell.

[0006] The level of CD4+ T cell stimulation may be determined by measuring either cell proliferation or production of a cytokine, such as interferon-γ (IFN-γ). The inventive methods may be usefully employed to isolate novel antigens associated with, for example, infectious disease agents, tumor tissue and autoimmune disorders.

[0007] In one embodiment, the host cell employed in the inventive methods is selected from the group consisting of E. coli, yeast and mammalian cells. Preferably, a cDNA or genomic DNA library is first prepared from the agent or tissue of interest to provide the plasmids employed to transform the host cell.

[0008] In another aspect, methods for identifying novel DNA sequences that encode antigens comprising antibody epitopes are provided.

[0009] In a further aspect, the present invention provides DNA sequences isolated using the inventive methods, together with sequences that are complementary to such DNA sequences and sequences that hybridize thereto under conditions of moderate stringency. Expression vectors comprising such DNA sequences and host cells transformed therewith are also provided. In yet another aspect, the present invention provides polypeptides comprising amino acid sequences encoded by such DNA sequences. Such polypeptides may be usefully employed in the diagnosis and treatment of disorders such as infectious diseases, cancers and autoimmune disorders.

[0010] These and other aspects of the present invention will become apparent upon reference to the following detailed description and attached drawings. All references disclosed herein are hereby incorporated by reference in their entirety as if each was incorporated individually.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011]FIGS. 1A and 1B illustrate the presentation of E. coli expressed Tb38-1 by macrophages and dendritic cells, respectively, as measured by production of interferon-γ by an autologous Tb38-1-specific CD4+ T cell clone.

[0012]FIGS. 2A and 2B illustrate the presentation of E. coli expressed Tb38-1 by macrophages and dendritic cells, respectively, as measured by stimulation of T cell proliferation by an autologous Tb38-1-specific CD4+ T cell clone.

[0013]FIG. 3 illustrates stimulation of cell proliferation in three different CD4+ T cell lines by dendritic cells incubated with a single E coli colony expressing Tb38-1 mixed with either 0, 14 or 26 E. coli colonies expressing irrelevant genes.

[0014]FIGS. 4A and 4B illustrate the stimulation of proliferation and interferon-γ production, respectively, in T cells derived from a first PPD-positive donor (referred to as D7) by recombinant ORF-2 and synthetic peptides to ORF-2.

[0015]FIG. 5A and 5B illustrate the stimulation of proliferation and interferon-γ production, respectively, in T cells derived from a second PPD-positive donor (referred to as D160) by recombinant ORF-2 and synthetic peptides to ORF-2.

DETAILED DESCRIPTION OF THE INVENTION

[0016] As noted above, the present invention is generally directed to methods for isolating DNA sequences that encode CD4+ T cell-stimulating antigens. As used herein the term “CD4+ T cell-stimulating antigens” refers to antigens that are capable of stimulating proliferation and/or cytokine production (for example, IL-2, GM-CSF, or IFN-γ) in T cells known to be CD4-positive. The inventive methods may be employed to isolate antigens associated with any disorder in which the stimulation of CD4+ T cells is believed to play a role in the body's immune response. For example, the methods may be used to isolate CD4+ T cell-stimulating antigens associated with infectious disease agents (such as Mycobacterium tuberculosis and Leishmaniasis), tumor tissue and autoimmune disorders.

[0017] A genomic or cDNA expression library is first prepared from the agent of interest, such as cultured M. tuberculosis or tumor tissue, using methods well known in the art (see, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y., 1989). The resulting plasmids are transformed into a host cell and protein expression is induced, for example, with IPTG. In one embodiment, the host cell is selected from the group consisting of E. coli, yeast and mammalian cells. The transformed host cells are then incubated with dendritic cells for a period of time sufficient to form peptide/MHC Class II complexes. In general, the cells may be incubated for about 60 to about 90 minutes at a temperature of about 37° C. The dendritic cells employed in the inventive methods are preferably immature, monocyte-derived dendritic cells, as described in detail in Example 1. These cells have been shown to phagocytose particles including bacteria, such as E. coli.

[0018] The presence of peptide/MHC Class II complex is determined by measuring the level of CD4+ T cell stimulation. Preferably, the antigen-presenting dendritic cells are incubated with CD4+ T cells specific for the antigen of interest for periods of about 3 days at about 37° C. Such CD4+ T cells are generated by stimulation of T cells with antigen (for example, in the form of lysate or secreted proteins from an infectious agent, whole pathogen or tumor tissue) in the presence of antigen-presenting cells. CD4+ T cell lines of the desired specificity may be cloned using limiting dilution techniques well known in the art.

[0019] The level of T cell stimulation is determined by measuring cell proliferation and/or cytokine (for example, IL-2, GM-CSF or IFN-γ) production and comparing the level of stimulation to that obtained with control dendritic cells (i.e. dendritic cells presenting irrelevant antigens). As described in detail below, cell proliferation and cytokine production may be evaluated by methods well known in the art. For example cell proliferation may be determined by exposing cells to a pulse of radiolabeled thymidine and measuring the incorporation of label into cellular DNA. Cytokine production may be evaluated in an enzyme-linked immunosorbent assay (ELISA), wherein wells are scored positive if the readout is greater than 3 standard deviations above the mean of the control dendritic cells.

[0020] A positive readout indicates that the transformed host cell is expressing a CD4+ T cell stimulating antigen. DNA encoding the antigen may then be isolated from the transformed host cell using techniques well known in the art, such as those taught by Sambrook et al., Ibid.

[0021] The inventive methods may additionally be employed to identify antigens comprising antibody epitopes. The antibody response to protein antigens requires help from CD4+ T cells. These cells respond to specific antigens and produce cytokines and co-stimulatory molecules that stimulate antibody production by B cells. Such antibody production cannot occur in the absence of a CD4+ T cell response. Therefore, screening of antigens for CD4+ T cell recognition will yield proteins capable of stimulating helper T cell responses resulting in antibody production. Such protein antigens will be the target of an antibody response and are therefore good candidate serodiagnostic antigens. As described in detail below, the inventive methods may be used to isolate the M. tuberculosis antigen Tb38-1 which has been previously shown to react with sera from tuberculosis-infected patients.

[0022] The present invention also provides isolated DNA sequences selected from the group consisting of: a) sequences obtained using the inventive methods; b) sequence complementary to such sequences; and c) sequences that hybridize to the sequences of a) or b) under conditions of moderate stringency. Suitable moderately stringent conditions include prewashing in a solution of 5×SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing at 50° C.-65° C., 5×SSC, overnight or, in the case of cross-species homology at 45° C., 0.5×SSC; followed by washing twice at 65° C. for 20 minutes with each of 2×, 0.5× and 0.2×SSC containing 0.1% SDS.

[0023] Expression vectors comprising such sequences and host cells transformed with such vectors are also provided. Suitable host cells include prokaryotes, yeast and higher eukaryotic cells. Preferably, the host cells employed are E. coli, yeast or a mammalian cell line such as COS or CHO. The DNA sequences expressed in this manner may encode naturally occurring antigens, portions of naturally occurring antigens, or other variants thereof.

[0024] The present invention further provides polypeptides comprising an immunogenic portion of an antigen, or a variant of such an antigen that differs only in conservative substitutions and/or modifications, wherein the antigen comprises an amino acid sequence encoded by one of the above DNA sequences. The antigens (and immunogenic portions thereof) isolated using the methods described herein have the ability to induce an immunogenic response. More specifically, the antigens have the ability to induce proliferation and/or cytokine production (for example, IFN-γ and/or IL-2 production) in CD4+ T cells.

[0025] As used herein, the term “polypeptide” encompasses amino acid chains of any length, including full length proteins (i.e., antigens), wherein the amino acid residues are linked by covalent peptide bonds. Thus, a polypeptide comprising an immunogenic portion of one of the above antigens may consist entirely of the immunogenic portion, or may contain additional sequences. The additional sequences may be derived from the native antigen or may be heterologous, and such sequences may (but need not) be immunogenic.

[0026] “Immunogenic,” as used herein, refers to the ability to elicit an immune response (e.g., cellular) in a patient, such as a human, and/or in a biological sample. In particular, antigens that are immunogenic (and immunogenic portions or other variants of such antigens) are capable of stimulating cell proliferation and/or cytokine production comprising at least an immunogenic portion of one or more antigens may generally be used in the detection of or induction of protective immunity against the disorder with which the antigen is associated.

[0027] A “variant,” as used herein, is a polypeptide that differs from the native antigen only in conservative substitutions and/or modifications, such that the ability of the polypeptide to induce an immune response is retained. Such variants may generally be identified by modifying one of the above polypeptide sequences, and evaluating the immunogenic properties of the modified polypeptide using, for example, the representative procedures described herein.

[0028] A “conservative substitution” is one in which an amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art of peptide chemistry would expect the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged. In general, the following groups of amino acids represent conservative changes: (1) ala, pro, gly, glu, asp, gln, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his.

[0029] Variants may also (or alternatively) be modified by, for example, the deletion or addition of amino acids that have minimal influence on the immunogenic properties, secondary structure and hydropathic nature of the polypeptide. For example, a polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end of the protein which co-translationally or post-translationally directs transfer of the protein. The polypeptide may also be conjugated to a linker or other sequence for ease of synthesis, purification or identification of the polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc domain.

[0030] Immunogenic portions of the antigens described herein may be prepared and identified using well known techniques, such as those summarized in Paul, Fundamental Immunology, 3d ed., Raven Press, 1993, pp. 243-247 and references cited therein. Such techniques include screening polypeptide portions of the native antigen for immunogenic properties. The representative proliferation and cytokine production assays described herein may generally be employed in these screens. An immunogenic portion of a polypeptide is a portion that, within such representative assays, generates an immune response (e.g., proliferation, interferon-γ production and/or interleukin-2 production) that is substantially similar to that generated by the full length antigen. In other words, an immunogenic portion of an antigen may generate at least about 20%, and preferably about 100%, of the cell proliferation induced by the full length antigen in the model proliferation assay described herein. An immunogenic portion may also, or alternatively, stimulate the production of at least about 20%, and preferably about 100%, of the interferon-γ and/or interleukin-2 induced by the full length antigen in the model assay described herein.

[0031] Portions and other variants of M. tuberculosis antigens may be generated by synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 amino acids, and generally fewer than about 50 amino acids, may be generated using techniques well known to those of ordinary skill in the art. For example, such polypeptides may be synthesized using any of the commercially available solid-phase techniques, such as the Merrifield solid-phase synthesis method, where amino acids are sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. Equipment for automated synthesis of polypeptides is commercially available from suppliers such as Perkin Elmer/Applied BioSystems Division, Foster City, Calif., and may be operated according to the manufacturer's instructions. Variants of a native antigen may generally be prepared using standard mutagenesis techniques, such as oligonucleotide-directed site-specific mutagenesis. Sections of the DNA sequence may also be removed using standard techniques to permit preparation of truncated polypeptides.

[0032] Recombinant polypeptides containing portions and/or variants of a native antigen may be readily prepared from DNA sequences isolated employing the inventive methods, using a variety of techniques well known to those of ordinary skill in the art. For example, supernatants from suitable host/vector systems which secrete recombinant protein into culture media may be first concentrated using a commercially available filter. Following concentration, the concentrate may be applied to a suitable purification matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse phase HPLC steps can be employed to further purify a recombinant protein.

[0033] In general, regardless of the method of preparation, the polypeptides disclosed herein are prepared in substantially pure form. Preferably, the polypeptides are at least about 80% pure, more preferably at least about 90% pure and most preferably at least about 99% pure. In certain preferred embodiments, described in detail below, the substantially pure polypeptides are incorporated into pharmaceutical compositions or vaccines for use in one or more of the methods disclosed herein.

[0034] The isolated DNA sequence obtained using the inventive methods (or a polypeptide comprising an amino acid sequence encoded by such a DNA sequence) may be employed in a patient to induce protective immunity against the disease agent from which the DNA sequence was isolated. As used herein, a “patient” refers to any warm-blooded animal, preferably a human. A patient may be afflicted with a disorder, or may be free of detectable disease and/or infection. In other words, protective immunity may be induced to prevent or treat a disorder.

[0035] In this aspect, the polypeptide is generally present within a pharmaceutical composition and/or a vaccine. Pharmaceutical compositions may comprise one or more polypeptides, and a physiologically acceptable carrier. Vaccines may comprise one or more of the above polypeptides and a non-specific immune response enhancer, such as an adjuvant or a liposome (into which the DNA molecule/polypeptide is incorporated). Such pharmaceutical compositions and vaccines may also contain other antigens, either incorporated into a combination polypeptide or present within a separate polypeptide.

[0036] Alternatively, a vaccine or pharmaceutical composition may contain DNA encoding one or more polypeptides as described above, such that the polypeptide is generated in situ. In such vaccines, the DNA may be present within any of a variety of delivery systems known to those of ordinary skill in the art, including nucleic acid expression systems, bacterial and viral expression systems. Appropriate nucleic acid expression systems contain the necessary DNA sequences for expression in the patient (such as a suitable promoter and terminating signal). Bacterial delivery systems involve the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion of the polypeptide on its cell surface. In a preferred embodiment, the DNA may be introduced using a viral expression system (e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic (defective), replication competent virus. Techniques for incorporating DNA into such expression systems are well known to those of ordinary skill in the art. The DNA may also be “naked,” as described, for example, in Ulmer et al., Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be increased by coating the DNA onto biodegradable beads, which are efficiently transported into the cells.

[0037] Routes and frequency of administration, as well as dosage, will vary from individual to individual. In general, the pharmaceutical compositions and vaccines may be administered by injection (e.g., intracutaneous, intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Between 1 and 3 doses may be administered for a 1-36 week period. Preferably, 3 doses are administered, at intervals of 3-4 months, and booster vaccinations may be given periodically thereafter. Alternate protocols may be appropriate for individual patients. A suitable dose is an amount of polypeptide or DNA that, when administered as described above, is capable of raising an immune response in an immunized patient sufficient to protect the patient from the disorder for at least 1-2 years. In general, the amount of polypeptide present in a dose (or produced in situ by the DNA in a dose) ranges from about 1 pg to about 100 mg per kg of host, typically from about 10 pg to about 1 mg, and preferably from about 100 pg to about 1 μg. Suitable dose sizes will vary with the size of the patient, but will typically range from about 0.1 mL to about 5 mL.

[0038] While any suitable carrier known to those of ordinary skill in the art may be employed in such pharmaceutical compositions, the type of carrier will vary depending on the mode of administration. For parenteral administration, such as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, lipids, a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, sucrose, and magnesium carbonate, may be employed. Biodegradable microspheres (e.g., polylactic galactide) may also be employed as carriers for the pharmaceutical compositions of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. Pat. Nos. 4,897,268 and 5,075,109.

[0039] Any of a variety of adjuvants may be employed in the vaccines of this invention to nonspecifically enhance the immune response. Suitable adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant and Freund's Complete Adjuvant (Difco Laboratories) and Merck Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.). Other suitable adjuvants include alum, biodegradable microspheres, monophosphoryl lipid A and quil A.

[0040] As noted above, polypeptides comprising an immunogenic portion of a CD4+ T cell stimulating antigen isolated using the methods of the present invention may also be employed in the diagnosis of disease, using techniques well known in the art. For example, polypeptides containing an immunogenic portion of a CD4+ T cell stimulating antigen isolated from an M. tuberculosis DNA expression library, as described in detail below, may be employed for the diagnosis of tuberculosis infection in a biological sample, using, for example, an ELISA technique. As used herein, a “biological sample” is any antibody-containing sample obtained from a patient. Preferably, the sample is whole blood, sputum, serum, plasma, saliva, cerebrospinal fluid or urine. More preferably, the sample is a blood, serum or plasma sample obtained from a patient or a blood supply.

[0041] The polypeptide(s) are used in an assay, to determine the presence or absence of antibodies to the polypeptide(s) in the sample, relative to a predetermined cut-off value. The presence of such antibodies indicates previous sensitization to the antigens which may be indicative of infection.

[0042] There are a variety of assay formats known to those of ordinary skill in the art for using one or more polypeptides to detect antibodies in a sample. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988, which is incorporated herein by reference. In a preferred embodiment, the assay involves the use of polypeptide immobilized on a solid support to bind to and remove the antibody from the sample. The bound antibody may then be detected using a detection reagent that contains a reporter group. Suitable detection reagents include antibodies that bind to the antibody/polypeptide complex and free polypeptide labeled with a reporter group (e.g., in a semi-competitive assay). Alternatively, a competitive assay may be utilized, in which an antibody that binds to the polypeptide is labeled with a reporter group and allowed to bind to the immobilized antigen after incubation of the antigen with the sample. The extent to which components of the sample inhibit the binding of the labeled antibody to the polypeptide is indicative of the reactivity of the sample with the immobilized polypeptide.

[0043] In certain embodiments, the assay is an enzyme linked immunosorbent assay (ELISA). This assay may be performed by first contacting a polypeptide that has been immobilized on a solid support, commonly the well of a microtiter plate, with the sample, such that antibodies to the polypeptide within the sample are allowed to bind to the immobilized polypeptide. Unbound sample is then removed from the immobilized polypeptide and a detection reagent capable of binding to the immobilized antibody-polypeptide complex is added. The amount of detection reagent that remains bound to the solid support is then determined using a method appropriate for the specific detection reagent. An appropriate detection reagent is any compound that binds to the immobilized antibody-polypeptide complex and that can be detected by any of a variety of means known to those in the art. Preferably, the detection reagent contains a binding agent (such as, for example, Protein A, Protein G, immunoglobulin, lectin or free antigen) conjugated to a reporter group. Preferred reporter groups include enzymes (such as horseradish peroxidase), substrates, cofactors, inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and biotin. The conjugation of binding agent to reporter group may be achieved using standard methods known to those of ordinary skill in the art. Common binding agents may also be purchased conjugated to a variety of reporter groups from many commercial sources (e.g., Zymed Laboratories, San Francisco, Calif., and Pierce, Rockford, Ill.).

[0044] The detection reagent is incubated with the immobilized antibody-polypeptide complex for an amount of time sufficient to detect the bound antibody. Unbound detection reagent is then removed and bound detection reagent is detected using the reporter group. The method employed for detecting the reporter group depends upon the nature of the reporter group. For radioactive groups, scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may be detected using avidin, coupled to a different reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the addition of substrate (generally for a specific period of time), followed by spectroscopic or other analysis of the reaction products.

[0045] To determine the presence or absence of anti-polypeptide antibodies in the sample, the signal detected from the reporter group that remains bound to the solid support is generally compared to a signal that corresponds to a predetermined cut-off value.

[0046] The following Examples are offered by way of illustration and not by way of limitation.

EXAMPLE 1 Presentation of a Known CD4+ T Cell Antigen

[0047] The ability of the inventive method to identify CD4+ T cell-stimulating antigens was demonstrated as follows.

[0048] The known M. tuberculosis antigen Tb38-1, described in U. S. patent application Ser. No. 08/533,634, (SEQ ID NO:67) was inserted into a modified pBSK (pBluescript) vector (Stratagene, La Jolla, Calif.) having a six histide tag and an enterokinase site inserted in the polylinker, and transformed into E. coli. As a control, E. coli were transformed with empty vector. Protein expression was induced by the addition of isopropyl-β-D-thiogalactopyranoside (IPTG). The transformed E. coli were then incubated with either immature monocyte-derived dendritic cells or with adherent macrophages at 37° C. for between 60 to 90 minutes. Dendritic cells were prepared by culturing adherent PBMC for one week in GM-CSF and IL-4. The presence of peptide/MHC Class II complexes was determined by measuring proliferation and production of IFN-γ in a CD4+ T cell clone (referred to as 4E4) which was generated by limiting dilution analysis with a synthetic peptide from Tb38-1 using cells from a PPD-positive donor known to react with recombinant Tb38-1.

[0049] IFN-γ production was measured using an enzyme-linked immunosorbent assay (ELISA). ELISA plates were coated with a mouse monoclonal antibody directed to human IFN-γ (Chemicon, Temecula, Calif.) in PBS for four hours at room temperature. Wells were then blocked with PBS containing 5% (W/V) non-fat dried milk for 1 hour at room temperature. The plates were washed six times in PBS/0.2% TWEEN-20 and [samples] diluted 1:2 in culture medium, and the ELISA plates were incubated overnight at room temperature. The plates were again washed and a polyclonal rabbit anti-human IFN-γ serum diluted 1:4000 in PBS/10% normal goat serum was added to each well. The plates were then incubated for two hours at room temperature, washed and horseradish peroxidase-coupled anti-rabbit IgG (Jackson Laboratories, West Grove, Pa.) was added at a 1:4000 dilution in PBS/5% non-fat dried milk. After a further two hour incubation at room temperature, the plates were washed and substrate added. The reaction was stopped after 20 min with 1 N sulfuric acid. Optical density was determined at 450 nm using 570 nm as a reference wavelength.

[0050]FIGS. 1A and B illustrate the presentation of E. coli-expressed Tb38-1 by macrophages and dendritic cells, respectively, to the TB38-1 reactive T cell clone 4E4, as measured by IFN-γ production. The induction of IFN-γ production by Tb38-1 transformed E. coli is compared with that for control E. coli (transformed with empty vector); Tb38-1 transformed E. coli diluted 1:10 with control E. coli (10%); Tb38-1 transformed E. coli diluted 1:50 with control E. coli (2%), and recombinant soluble Tb38-1 (1 μg/ml). Cultures were performed in duplicate in 96-well flat-bottom plates in a volume of 200 μl. Medium was RPMI/10% pooled human serum plus gentamicin.

[0051]FIGS. 2A and B show data similar to FIGS. 1A and B, except that instead of production of interferon-γ, cell proliferation was measured. Specifically, FIGS. 2A and 2B illustrate the presentation of E. coli-expressed Tb38-1 by macrophages and dendritic cells, respectively, as measured by CD4+ T cell proliferation. The stimulation of cell proliferation by E. coli-expressed Tb38-1 is compared with that for: control E. coil (transformed with empty vector); Tb38-1 transformed E. coli diluted 1:10 with control E. coli (10%); and Tb38-1 transformed E. coli diluted 1:50 with control E. coli (2%). After three days of culture in 96-well round-bottom plates in a volume of 200 μl, the plates were pulsed with 1 μCi/well of tritiated thymidine for a further 18 hours, harvested and tritium uptake determined using a gas scintillation counter.

[0052] The ability of the inventive method to identify a CD4+ T cell-stimulating antigen when expressed by one of a multitude of E. coli colonies was demonstrated as follows. Two CD4+ T cell lines, referred to as DC-4 and DC-5, were generated against dendritic cells infected with M. tuberculosis. Specifically, dendritic cells were prepared from adherent PBMC from a single donor as described above and subsequently infected with tuberculosis. Lynphocytes from the same donor were cultured under limiting dilution conditions with the infected dendritic cells to generate the T cell lines DC-4 and DC-5. These cell lines were shown to react with crude soluble proteins from M. tuberculosis but not with Tb38-1. Limiting dilution conditions were employed to obtain a third CD4+ T cell line, referred to as DC-6, which was shown to react with both crude soluble proteins and Tb38-1.

[0053] A single E. coli colony expressing the M. tuberculosis antigen Tb38-1 was mixed with either 0, 14 or 26 E. coli colonies expressing irrelevant antigens. These pools were grown overnight and then induced. The resulting pools were incubated with dendritic cells as described above and the presentation of peptide/MHC Class II complex was detected by measuring cell proliferation in the DC-4, DC-5 and DC-6 cell lines. The results are shown in FIG. 3, wherein CSP refers to culture supernatant M. tuberculosis proteins. Using the Tb38-1 specific T cell line, DC-6, the presence of E. coli-expressed Tb38-1 was detected in each of the E. coli pools. The T cell lines DC-4 and DC-5 did not react with E. coli-expressed Tb38-1.

[0054] Based upon the data shown in FIGS. 1-3, it was determined that a pool size of between about 30 and about 80, preferably about 50, is acceptable with moderate level expression of target antigen. However, use of a different expression vector may allow an increase in pool size.

EXAMPLE 2 Purification and Characterization of Novel CD4+ Antigens from M. Tuberculosis

[0055] Genomic DNA was isolated from the M. tuberculosis strains H37Rv and Erdman and used to construct expression libraries in the vector pBSK(−)using the Lambda ZAP expression system (Stratagene, La Jolla, Calif.). These libraries were transformed into E. coli, pools of induced E. coli cultures were incubated with dendritic cells, and the ability of the resulting incubated dendritic cells to stimulate cell proliferation and IFN-γ production in the CD4+ T cell line DC-6 was examined as described above. Positive pools were fractionated and re-tested until pure M. tuberculosis clones were obtained.

[0056] Twenty-three clones were isolated, of which nine were found to contain the previously identified M. tuberculosis antigens TbH-9 and Tb38-1, disclosed in U.S. patent application Ser. No. 08/533,634. The determined cDNA sequences for the remaining twelve clones (hereinafter referred to as Tb224, Tb636, Tb424, Tb436, Tb398, Tb508, Tb441, Tb475, Tb488, Tb465, Tb431 and Tb472) are provided in SEQ ID No: 1-12, respectively. The corresponding predicted amino acid sequences for Tb224, Tb636 and Tb431 are provided in SEQ ID NO: 13-15, respectively. These three antigens were found to show some homology to TbH-9, described above. Tb224 and Tb636 were found to be overlapping clones.

[0057] Tb424, Tb436, Tb398, Tb508, Tb441, Tb475, Tb488 and Tb465 were each found to contain two small open reading frames (referred to as ORF-1 and ORF-2) or truncated forms thereof, with minor variations in ORF-1 and ORF-2 being found for each clone. The predicted amino acid sequences of ORF-1 and ORF-2 for Tb424, Tb436, Tb398, Tb508, Tb441, Tb475, Tb488 and Tb465 are provided in SEQ ID NO: 16 and 17, 18 and 19, 20 and 21, 22 and 23, 24 and 25, 26 and 27, 28 and 29, and 30 and 31, respectively. In addition, clones Tb424 and Tb436 were found to contain a third apparent open reading frame, referred to as ORF-U. The predicted amino acid sequences of ORF-U for Tb424 and Tb436 are provided in SEQ ID NO: 32 and 33, respectively. Tb424 and Tb436 were found to be either overlapping clones or recently duplicated/transposed copies. Similarly Tb398, Tb508 and Tb465 were found to be either overlapping clones or recently duplicated/transposed copies, as were Tb475 and Tb488.

[0058] These sequences were compared with known sequences in the gene bank using the BLASTN system. No homologies to the antigens Tb224, Tb431 and Tb472 were found. Tb636 was found to be 100% identical to a cosmid previously identified in M. tuberculosis. Similarly, Tb508, Tb488, Tb398, Tb424, Tb436, Tb441, Tb465 and Tb475 were found to show homology to known M. tuberculosis cosmids. In addition, Tb488 was found to have 100% homology to M. tuberculosis topoisomerase I.

[0059] Seventeen overlapping peptides to the open reading frames ORF-1 (referred to as 1-1-1-17; SEQ ID NO: 34-50, respectively) and sixteen overlapping peptides to the open reading frame ORF-2 (referred to as 2-1-2-16, SEQ ID NO: 51-66) were synthesized using the procedure described below in Example 3.

[0060] The ability of the synthetic peptides, and of recombinant ORF-1 and ORF-2, to induce T cell proliferation and IFN-γ production in PBMC from PPD-positive donors was assayed as described below in Example 4. FIGS. 4A-B illustrate stimulation of T cell proliferation and IFN-γ production, respectively, by recombinant ORF-2 (referred to as MTI) and the synthetic peptides 2-1-2-16 for a donor referred to as D7. FIGS. 5A-B illustrate stimulation of T cell proliferation and IFN-γ, respectively, for the donor D160 by recombinant ORF-2 and other known M. tuberculosis antigens. Recombinant ORF-2 stimulated T cell proliferation and IFN-γ production in PBMC from both donors. The amount of PBMC stimulation seen with the individual synthetic peptides varied with each donor, indicating that each donor recognizes different epitopes on ORF2.

[0061] Similarly, two overlapping peptides (SEQ ID NO: 68 and 69) to the open reading frame of Tb224 were synthesized and shown to induce T cell proliferation and IFN-γ production in PBMC from PPD-positive donors.

[0062] In subsequent studies, the above method was used to screen an M. tuberculosis genomic DNA library prepared as follows. Genomic DNA from M. tuberculosis Erdman strain was randomly sheared to an average size of 2 kb, and blunt ended with Klenow polymerase, followed by the addition of EcoRI adaptors. The insert was subsequently ligated into the Screen phage vector (Novagen, Madison, Wis.) and packaged in vitro using the PhageMaker extract (Novagen). The phage library (referred to as the Erd λScreen library) was amplified and a portion was converted into a plasmid expression library by an autosubcloning mechanism using the E. coli strain BM25.8 (Novagen). Plasmid DNA was purified from BM25.8 cultures containing the pSCREEN recombinants and used to transform competent cells of the expressing host strain BL21(DE3)pLysS. Transformed cells were aliquoted into 96 well microtiter plates with each well containing a pool size of approximately 50 colonies. Replica plates of the 96 well plasmid library format were induced with IPTG to allow recombinant protein expression. Following induction, the plates were centrifuged to pellet the E. coli which was used directly in T cell expression cloning of a T cell line prepared from a PPD-positive donor (donor 160) as described above. Pools containing E. coli expressing M. tuberculosis T cell antigens were subsequently broken down into individual colonies and reassayed in a similar fashion to identify positive hits.

[0063] Screening of the T cell line from donor 160 with one 96 well plate of the Erd λScreen library provided a total of nine positive hits. Previous experiments on the screening of the pBSK library described above with sera from donor 160 suggested that most or all of the positive clones would be TbH-9, Tb38-1 or MTI (disclosed in U.S. patent application Ser. No. 08/533,634) or variants thereof. However, Southern analysis revealed that only three wells hybridized with a mixed probe of TbH-9, Tb38-1 and MTI. Of the remaining six positive wells, two were found to be identical.

EXAMPLE 3 Synthesis of Synthetic Polypeptides

[0064] Polypeptides may be synthesized on a Millipore 9050 peptide synthesizer using FMOC chemistry with HPTU (O-Benzotriazole-N,N,N′,N′-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a method of conjugation or labeling of the peptide. Cleavage of the peptides from the solid support may be carried out using the following cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then be dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior to purification by C18 reverse phase HPLC. A gradient of 0-60% acetonitrile (containing 0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure fractions, the peptides may be characterized using electrospray mass spectrometry and by amino acid analysis.

EXAMPLE 4 Induction of T Cell Proliferation abd Interferon-γ Production by M. Tuberculosis Antigens

[0065] The ability of recombinant M. tuberculosis antigens and synthetic peptides to induce T cell proliferation and interferon-γ production may be determined as follows.

[0066] Proteins may be induced by IPTG and purified by gel elution, as described in Skeiky et al. J. Exp. Med., 1995, 181:1527-1537. The purified polypeptides are then screened for the ability to induce T-cell proliferation in PBMC preparations. The PBMCs from donors known to be PPD skin test positive and whose T-cells are known to proliferate in response to PPD, are cultured in medium comprising RPMI 1640 supplemented with 10% pooled human serum and 50 μg/ml gentamicin. Purified polypeptides are added in triplicate at concentrations of 0.5 to 10 μg/mL. After five days of culture in 96-well round-bottom plates in a volume of 200 μl, 50 μl of medium is removed from each well for determination of IFN-γ levels, as described below. The plates are then pulsed with 1 μCi/well of tritiated thymidine for a further 18 hours, harvested and tritium uptake determined using a gas scintillation counter. Fractions that result in proliferation in all three replicates three fold greater than the proliferation observed in cells cultured in medium alone are considered positive.

[0067] IFN-γ is measured using an enzyme-linked immunosorbent assay (ELISA). ELISA plates are coated with a mouse monoclonal antibody directed to human IFN-γ (Chemicon, Temecula, Calif.) in PBS for four hours at room temperature. Wells are then blocked with PBS containing 5% (W/V) non-fat dried milk for 1 hour at room temperature. The plates are washed six times in PBS/0.2% TWEEN-20 and samples diluted 1:2 in culture medium in the ELISA plates are incubated overnight at room temperature. The plates are again washed and a polyclonal rabbit anti-human IFN-γ serum diluted 1:4000 in PBS/10% normal goat serum is added to each well. The plates are then incubated for two hours at room temperature, washed and horseradish peroxidase-coupled anti-rabbit IgG (Jackson Immunoresearch Laboratories, West Grove, Pa.) is added at a 1:4000 dilution in PBS/5% non-fat dried milk. After a further two hour incubation at room temperature, the plates are washed and TMB substrate added. The reaction is stopped after 20 min with 1 N sulfuric acid. Optical density is determined at 450 nm using 570 nm as a reference wavelength. Fractions that result in both replicates giving an OD two fold greater than the mean OD from cells cultured in medium alone, plus 3 standard deviations, are considered positive.

[0068] From the foregoing, it will be appreciated that, although specific embodiments of the invention have been described herein for the purpose of illustration, various modifications may be made without deviating from the spirit and scope of the invention.

1 69 1886 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 1 CGCTCTGGTG ACCACCAACT TCTTCGGTGT CAACACCATC CCGATCGCCC TCAACGAGGC 60 CGACTACCTG CGCATGTGGA TCCAGGCCGC CACCGTCATG AGCCACTATC AAGCCGTCGC 120 GCACGAAATC TGGTGTCTCC ATGAATANGC CAGTTCGGGA AAGCCGTGGG CCAGTATCAC 180 CACGGGTGCG CCGGGCTCAC CGGCCTCGAC CACTCGCAGT CGCACGCCGT TGGTATCAAC 240 TAACCGTNCN GTANGTGCGC CCATCGTCTC ACCAAATCAC ACCGGGCACC GGCCTGAGAA 300 GGGCTTGGGG AGCANCCAGA GGCGATTGTC GCGGGTGCTG CCGCGCATCA TTGATCGGCC 360 GGCCGGACCA NTCGGGCCTC CCTTGACGTC CGGATCNCAC TTCCTGTGCA GCTGGCATGG 420 CTACAGCTCA CAGTGACTGC CCCACGATTG CCGGCCAGGT CCAGTTCAAA TTCCGGTGAA 480 TTCGCGGACA AAAGCAGCAG GTCAACCAAC CGCAGTCAGT CGAGGGTCCC AAACGTGAGC 540 CAATCGGTGA AATGGCTTGC TGCAGTGACA CCGGTCACAG GCTTAGCCGA CAGCACCGGA 600 ATAGCTCAGG CGGGCTATAG AGTCCTATAG AAACATTTGC TGATAGAATT AACCGCTGTC 660 TTGGCGTGAT CTTGATACGG CTCGCCGTGC GACCGGTTGG CTCAGTAGCT GACCACCATG 720 TAACCCATCC TCGGCAGGTG TCTACTAAGG CGAGACACCG CATTGGTGGG GCTGCATCGC 780 AAATCGGTCC GAGCATGTAG CACTGCCGTT ATCCCGGGAT AGCAAACCAC CCGGAACCAG 840 GGCTATCCCA GTCGCTCTCC GACGGAGGCC GTTTCGCTTT CCGTTGCCCG ATAACTCCCG 900 AGTGGATATC GGCGTTATCA NATTCAGGCT TTTCTTCGCA AGGTACCGGT GTTCGCTATA 960 TTCGGATATC TCGGACGGAT AATTACTAAA ACTTCAGTGG TTTAGATAAG GCCGCCGCAA 1020 TACTTCGCCG ATCTTGCCGA GCGCAACGGA TTTCCATCGT CGGTTTTCGT CGCCTTATCA 1080 AACATGATCG GAGATAATGA CAGATCGGCC TAGCTAGGTG TTTAGCGGAC GCGATTTAGG 1140 ACAACCGAGA TTTGCTTTGC CTCGCAACCA TGAGAGCGCC CCGCTTCGAC GCCGAATCGG 1200 GTGAGTGATG GTGGGTTAGC ACAGCCCTGA TTGCGCCACC GGCGAGGTGA TTGTGCCCGC 1260 CACGAGGCCG CCGCCGGCTA GCCCCATGAG CACGNTATAT AGACTCTCCT GCAACAGATC 1320 TCATACCGAT CGAAGGCGAA GCGCAGGCAT CGACGTCGGA GACACTGCCT TGGGATCGCG 1380 CCGCCTACAC GGCGGTTGGC GCATTGTCGC AGCGCAGTTG CAGGAGGGCA AATGTGCGCA 1440 GACGATGTAG TCGACAACAA GTGNACATGC CGTCTTCACG AACTCAAAAC TGACGATCTG 1500 CTTAGCATGA AAAAAACTGT TGACATCGGC CAAGCATGAC AGCCAGACTG TAGGCCTACG 1560 CGTGCAATGC AGAACCAAGG NTATGCATGG AATCGACGAC CGTTGAGATA GGCGGCAGGC 1620 ATGAGCAGAG CGTTCATCAT CGATCCAACG ATCAGTGCCA TTGACGGCTT GTACGACCTT 1680 CTGGGGATTG GAATACCCAA CCAAGGGGGT ATCCTTTACT CCTCACTAGA GTACTTCGAA 1740 AAAGCCCTGG AGGAGCTGGC AGCAGCGTTT CCGGGTGATG GCTGGTTAGG TTCGGCCGCG 1800 GACAAATACG CCGGCAAAAA CCGCAACCAC GTGAATTTTT TCCAGGAACT GGCAGACCTC 1860 GATCGTCAGC TCATCAGCCT GATCCA 1886 2305 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 2 GGCACGCGCT GGCCGCGCAA TACACCGAAA TTGCAACGGA ACTCGCAAGC GTGCTCGCTG 60 CGGTGCAGGC AAGCTCGTGG CAGGGGCCCA GCGCCGACCG GTTCGTCGTC GCCCATCAAC 120 CGTTCCGGTA TTGGCTAACC CACGCTGCCA CGGTGGCCAC CGCAGCAGCC GCCGCGCACN 180 AAACGGCCGC CGCCGGGTAT ACGTCCGCAT TGGGGGGCAT GCCTACGCTA GCCGAGTTGG 240 CGGCCAACCA TGCCATGCAC GGCGCTCTGG TGACCACCAA CTTCTTCGGT GTCAACACCA 300 TCCCGATCGC CCTCAACGAG GCCGACTACC TGCGCATGTG GATCCAGGCC GCCACCGTCA 360 TGAGCCACTA TCAAGCCGTC GCGCACGAAA GCGTGGCGGC GACCCCCAGC ACGCCGCCGG 420 CGCCGCAGAT AGTGACCAGT GCGGCCAGCT CGGCGGCTAG CAGCAGCTTC CCCGACCCGA 480 CCAAATTGAT CCTGCAGCTA CTCAAGGATT TCCTGGAGCT GCTGCGCTAT CTGGCTGTTG 540 AGCTGCTGCC GGGGCCGCTC GGCGACCTCA TCGCCCAGGT GTTGGACTGG TTCATCTCGT 600 TCGTGTCCGG TCCAGTCTTC ACGTTTCTCG CCTACCTGGT GCTGGACCCA CTGATCTATT 660 TCGGACCGTT CGCCCCGCTG ACGAGTCCGG TCCTGTTGCC TGCTGTGGAG TTACGCAACC 720 GCCTCAAAAC CGCCACCGGA CTGACGCTGC CACCTACCGT GATTTTCGAT CATCCCACTC 780 CCACTGCGGT CGCCGAGTAT GTCGCCCAGC AAATGTCTGG CAGCCGCCCA ACGGAATCCG 840 GTGATCCGAC GTCGCAGGTT GTCGAACCCG CTCGTGCCGA ATTCGGCACG AGTGCTGTTC 900 ATCAAATCCC CCCGAGACCT GCGGACACCC GGCGCGCTTG CCGACATCGA GATGATGTCC 960 CGCGAGATAG CAGAATTGCC CAACATCGTG ATGGTGCGGG GCTTGACCCG ACCGAACGGG 1020 GAACCTCTGA AGGAGACCAA GGTCTCGTTT CAGGCTGGTG AAGTGGGCGG CAAGCTCGAC 1080 GAAGCGACCA CCCTGCTCGA AGAGCACGGA GGCGAGCTGG ACCAGCTGAC CGGCGGTGCG 1140 CACCAGTTGG CCGACGCCCT CGCCCAAATA CGCAACGAAA TCAATGGGGC CGTGGCCAGC 1200 TCGAGCGGGA TAGTCAACAC CCTGCAGGCC ATGATGGACC TGATGGGCGG TGACAAGACC 1260 ATCCGACAAC TGGAAAATGC GTCCCAATAT GTCGGGCGCA TGCGGGCTCT GGGGGACAAT 1320 CTGAGCGGGA CCGTCACCGA TGCCGAACAA ATCGCCACTT GGGCCAGCCC TATGGTCAAC 1380 GCCCTCAACT CCAGCCCGGT GTGTAACAGC GATCCCGCCT GTCGGACGTC GCGCGCACAG 1440 TTGGCGGCGA TTGTCCAGGC GCAGGACGAC GGCCTGCTCA GGTCCATCAG AGCGCTAGCC 1500 GTCACCCTGC AACAGACGCA GGAATACCAG ACACTCGCCC GGACGGTGAG CACACTGGAC 1560 GGGCAACTGA AGCAAGTCGT CAGCACCCTC AAAGCGGTCG ACGGCCTACC CACCAAATTG 1620 GCTCAAATGC AGCAAGGAGC CAACGCTCTC GCCGACGGCA GCGCAGCGCT GGCGGCAGGC 1680 GTGCAGGAAT TGGTCGATCA GGTCAAAAAG ATGGGCTCAG GGCTCAACGA GGCCGCCGAC 1740 TTCCTGTTGG GGATCAAGCG GGATGCGGAC AAGCCGTCAA TGGCGGGCTT CAACATTCCA 1800 CCGCAGATTT TTTCGAGGGA CGAGTTCAAG AAGGGCGCCC AGATTTTCCT GTCGGCCGAT 1860 GGTCATGCGG CGCGGTACTT CGTGCAGAGC GCGCTGAATC CGGCCACCAC CGAGGCGATG 1920 GATCAGGTCA ACGATATCCT CCGTGTTGCG GATTCCGCGC GACCGAATAC CGAACTCGAG 1980 GATGCCACGA TAGGTCTGGC GGGGGTTCCG ACTGCGCTGC GGGATATCCG CGACTACTAC 2040 AACAGCGATA TGAAATTCAT CGTCATTGCG ACGATCGTTA TCGTATTCTT GATTCTCGTC 2100 ATTCTGNTGC GCGCACTTGT GGNTCCGATA TATCTGATAG GCTCGGTGCT GATTTCTTAC 2160 TTGTCGGCCC TAGGCATAGG AACTTTCGTT TTCCAATTGA TACTGGGCCA GGAAATGCAT 2220 TGGAGCCTGC CGGGACTGTC CTTCATATTA TTGGTTGCCA TCGGCGCTGA CTACAACATG 2280 CTGCTCATTT CACGCATCCG CGACG 2305 1742 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 3 CCGCTCTCTT TCAACGTCAT AAGTTCGGTG GGCCAGTCGG CCGCGCGTGC ATATGGCACC 60 AATAACGCGT GTCCCATGGA TACCCGGACC GCACGACGGT AGAGCGGATC AGCGCAGCCG 120 GTGCCGAACA CTACCGCGTC CACGCTCAGC CCTGCCGCGT TGCGGAAGAT CGAGCCCAGG 180 TTCTCATGGT CGTTAACGCC TTCCAACACT GCGACGGTGC GCGCCCCGGC GACCACCTGA 240 GCAACGCTCG GCTCCGGCAC CCGGCGCGCG GCTGCCAACA CCCCACGATT GAGATGGAAG 300 CCGATCACCC GTGCCATGAC ATCAGCCGAC GCTCGATAGT ACGGCGCGCC GACACCGGCC 360 AGATCATCCT TGAGCTCGGC CAGCCGGCGG TCGGTGCCGA ACAGCGCCAG CGGCGTGAAC 420 CGTGAGGCCA GCATGCGCTG CACCACCAGC ACACCCTCGG CGATCACCAA CGCCTTGCCG 480 GTCGGCAGAT CGGGACNACN GTCGATGCTG TTCAGGTCAC GGAAATCGTC GAGCCGTGGG 540 TCGTCGGGAT CGCAGACGTC CTGAACATCG AGGCCGTCGG GGTGCTGGGC ACAACGGCCT 600 TCGGTCACGG GCTTTCGTCG ACCAGAGCCA GCATCAGATC GGCGGCGCTG CGCAGGATGT 660 CACGCTCGCT GCGGTTCAGC GTCGCGAGCC GCTCAGCCAG CCACTCTTGC AGAGAGCCGT 720 TGCTGGGATT AATTGGGAGA GGAAGACAGC ATGTCGTTCG TGACCACACA GCCGGAAGCC 780 CTGGCAGCTG CGGCGGCGAA CCTACAGGGT ATTGGCACGA CAATGAACGC CCAGAACGCG 840 GCCGCGGCTG CTCCAACCAC CGGAGTAGTG CCCGCAGCCG CCGATGAAGT ATCAGCGCTG 900 ACCGCGGCTC AGTTTGCTGC GCACGCGCAG ATGTACCAAA CGGTCAGCGC CCAGGCCGCG 960 GCCATTCACG AAATGTTCGT GAACACGCTG GTGGCCAGTT CTGGCTCATA CGCGGCCACC 1020 GAGGCGGCCA ACGCAGCCGC TGCCGGCTGA ACGGGCTCGC ACGAACCTGC TGAAGGAGAG 1080 GGGGAACATC CGGAGTTCTC GGGTCAGGGG TTGCGCCAGC GCCCAGCCGA TTCAGNTATC 1140 GGCGTCCATA ACAGCAGACG ATCTAGGCAT TCAGTACTAA GGAGACAGGC AACATGGCCT 1200 CACGTTTTAT GACGGATCCG CATGCGATGC GGGACATGGC GGGCCGTTTT GAGGTGCACG 1260 CCCAGACGGT GGAGGACGAG GCTCGCCGGA TGTGGGCGTC CGCGCAAAAC ATTTCCGGTG 1320 CGGGCTGGAG TGGCATGGCC GAGGCGACCT CGCTAGACAC CATGACCTAG ATGAATCAGG 1380 CGTTTCGCAA CATCGTGAAC ATGCTGCACG GGGTGCGTGA CGGGCTGGTT CGCGACGCCA 1440 ACAANTACGA ACAGCAAGAG CAGGCCTCCC AGCAGATCCT GAGCAGNTAG CGCCGAAAGC 1500 CACAGCTGNG TACGNTTTCT CACATTAGGA GAACACCAAT ATGACGATTA ATTACCAGTT 1560 CGGGGACGTC GACGCTCATG GCGCCATGAT CCGCGCTCAG GCGGCGTCGC TTGAGGCGGA 1620 GCATCAGGCC ATCGTTCGTG ATGTGTTGGC CGCGGGTGAC TTTTGGGGCG GCGCCGGTTC 1680 GGTGGCTTGC CAGGAGTTCA TTACCCAGTT GGGCCGTAAC TTCCAGGTGA TCTACGAGCA 1740 GG 1742 2836 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 4 GTTGATTCCG TTCGCGGCGC CGCCGAAGAC CACCAACTCC GCTGGGGTGG TCGCACAGGC 60 GGTTGCGTCG GTCAGCTGGC CGAATCCCAA TGATTGGTGG CTCNGTGCGG TTGCTGGGCT 120 CGATTACCCC CACGGAAAGG ACGACGATCG TTCGTTTGCT CGGTCAGTCG TACTTGGCGA 180 CGGGCATGGC GCGGTTTCTT ACCTCGATCG CACAGCAGCT GACCTTCGGC CCAGGGGGCA 240 CAACGGCTGG CTCCGGCGGA GCCTGGTACC CAACGCCACA ATTCGCCGGC CTGGGTGCAG 300 GCCCGGCGGT GTCGGCGAGT TTGGCGCGGG CGGAGCCGGT CGGGAGGTTG TCGGTGCCGC 360 CAAGTTGGGC CGTCGCGGCT CCGGCCTTCG CGGAGAAGCC TGAGGCGGGC ACGCCGATGT 420 CCGTCATCGG CGAAGCGTCC AGCTGCGGTC AGGGAGGCCT GCTTCGAGGC ATACCGCTGG 480 CGAGAGCGGG GCGGCGTACA GGCGCCTTCG CTCACCGATA CGGGTTCCGC CACAGCGTGA 540 TTACCCGGTC TCCGTCGGCG GGATAGCTTT CGATCCGGTC TGCGCGGCCG CCGGAAATGC 600 TGCAGATAGC GATCGACCGC GCCGGTCGGT AAACGCCGCA CACGGCACTA TCAATGCGCA 660 CGGCGGGCGT TGATGCCAAA TTGACCGTCC CGACGGGGCT TTATCTGCGG CAAGATTTCA 720 TCCCCAGCCC GGTCGGTGGG CCGATAAATA CGCTGGTCAG CGCGACTCTT CCGGCTGAAT 780 TCGATGCTCT GGGCGCCCGC TCGACGCCGA GTATCTCGAG TGGGCCGCAA ACCCGGTCAA 840 ACGCTGTTAC TGTGGCGTTA CCACAGGTGA ATTTGCGGTG CCAACTGGTG AACACTTGCG 900 AACGGGTGGC ATCGAAATCA ACTTGTTGCG TTGCAGTGAT CTACTCTCTT GCAGAGAGCC 960 GTTGCTGGGA TTAATTGGGA GAGGAAGACA GCATGTCGTT CGTGACCACA CAGCCGGAAG 1020 CCCTGGCAGC TGCGGCGGCG AACCTACAGG GTATTGGCAC GACAATGAAC GCCCAGAACG 1080 CGGCCGCGGC TGCTCCAACC ACCGGAGTAG TGCCCGCAGC CGCCGATGAA GTATCAGCGC 1140 TGACCGCGGC TCAGTTTGCT GCGCACGCGC AGATGTACCA AACGGTCAGC GCCCAGGCCG 1200 CGGCCATTCA CGAAATGTTC GTGAACACGC TGGTGGCCAG TTCTGGCTCA TACGCGGCCA 1260 CCGAGGCGGC CAACGCAGCC GCTGCCGGCT GAACGGGCTC GCACGAACCT GCTGAAGGAG 1320 AGGGGGAACA TCCGGAGTTC TCGGGTCAGG GGTTGCGCCA GCGCCCAGCC GATTCAGCTA 1380 TCGGCGTCCA TAACAGCAGA CGATCTAGGC ATTCAGTACT AAGGAGACAG GCAACATGGC 1440 CTCACGTTTT ATGACGGATC CGCATGCGAT GCGGGACATG GCGGGCCGTT TTGAGGTGCA 1500 CGCCCAGACG GTGGAGGACG AGGCTCGCCG GATGTGGGCG TCCGCGCAAA ACATTTCCGG 1560 TGCGGGCTGG AGTGGCATGG CCGAGGCGAC CTCGCTAGAC ACCATGACCT AGATGAATCA 1620 GGCGTTTCGC AACATCGTGA ACATGCTGCA CGGGGTGCGT GACGGGCTGG TTCGCGACGC 1680 CAACAACTAC GAACAGCAAG AGCAGGCCTC CCAGCAGATC CTGAGCAGCT AGCGCCGAAA 1740 GCCACAGCTG CGTACGCTTT CTCACATTAG GAGAACACCA ATATGACGAT TAATTACCAG 1800 TTCGGGGACG TCGACGCTCA TGGCGCCATG ATCCGCGCTC AGGCGGCGTC GCTTGAGGCG 1860 GAGCATCAGG CCATCGTTCG TGATGTGTTG GCCGCGGGTG ACTTTTGGGG CGGCGCCGGT 1920 TCGGTGGCTT GCCAGGAGTT CATTACCCAG TTGGGCCGTA ACTTCCAGGT GATCTACGAG 1980 CAGGCCAACG CCCACGGGCA GAAGGTGCAG GCTGCCGGCA ACAACATGGC GCAAACCGAC 2040 AGCGCCGTCG GCTCCAGCTG GGCCTAAAAC TGAACTTCAG TCGCGGCAGC ACACCAACCA 2100 GCCGGTGTGC TGCTGTGTCC TGCAGTTAAC TAGCACTCGA CCGCTGAGGT AGCGATGGAT 2160 CAACAGAGTA CCCGCACCGA CATCACCGTC AACGTCGACG GCTTCTGGAT GCTTCAGGCG 2220 CTACTGGATA TCCGCCACGT TGCGCCTGAG TTACGTTGCC GGCCGTACGT CTCCACCGAT 2280 TCCAATGACT GGCTAAACGA GCACCCGGGG ATGGCGGTCA TGCGCGAGCA GGGCATTGTC 2340 GTCAACGACG CGGTCAACGA ACAGGTCGCT GCCCGGATGA AGGTGCTTGC CGCACCTGAT 2400 CTTGAAGTCG TCGCCCTGCT GTCACGCGGC AAGTTGCTGT ACGGGGTCAT AGACGACGAG 2460 AACCAGCCGC CGGGTTCGCG TGACATCCCT GACAATGAGT TCCGGGTGGT GTTGGCCCGG 2520 CGAGGCCAGC ACTGGGTGTC GGCGGTACGG GTTGGCAATG ACATCACCGT CGATGACGTG 2580 ACGGTCTCGG ATAGCGCCTC GATCGCCGCA CTGGTAATGG ACGGTCTGGA GTCGATTCAC 2640 CACGCCGACC CAGCCGCGAT CAACGCGGTC AACGTGCCAA TGGAGGAGAT CTCGTGCCGA 2700 ATTCGGCACG AGGCACGAGG CGGTGTCGGT GACGACGGGA TCGATCACGA TCATCGACCG 2760 GCCGGGATCC TTGGCGATCT CGTTGAGCAC GACCCGGGCC CGCGGGAAGC TCTGCGACAT 2820 CCATGGGTTC TTCCCG 2836 900 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 5 AACATGCTGC ACGGGGTGCG TGACGGGCTG GTTCGCGACG CCAACAACTA CGAGCAGCAA 60 GAGCAGGCCT CCCAGCAGAT CCTCAGCAGC TAACGTCAGC CGCTGCAGCA CAATACTTTT 120 ACAAGCGAAG GAGAACAGGT TCGATGACCA TCAACTATCA GTTCGGTGAT GTCGACGCTC 180 ACGGCGCCAT GATCCGCGCT CAGGCCGGGT TGCTGGAGGC CGAACATCAG GCCATCATTC 240 GTGATGTGTT GACCGCGAGT GACTTTTGGG GCGGCGCCGG TTCGGCGGCC TGCCAGGGGT 300 TCATTACCCA ATTGGGCCGT AACTTCCAGG TGATCTACGA ACAGGCCAAC GCCCACGGGC 360 AGAAGGTGCA GGCTGCCGGC AACAACATGG CGCAAACCGA CAGCGCCGTC GGCTCCAGCT 420 GGGCCTGACA CCAGGCCAAG GCCAGGGACG TGGTGTACGA GTGAAGGTTC CTCGCGTGAT 480 CCTTCGGGTG GCAGTCTAGG TGGTCAGTGC TGGGGTGTTG GTGGTTTGCT GCTTGGCGGG 540 TTCTTCGGTG CTGGTCAGTG CTGCTCGGGC TCGGGTGAGG ACCTCGAGGC CCAGGTAGCG 600 CCGTCCTTCG ATCCATTCGT CGTGTTGTTC GGCGAGGACG GCTCCGACGA GGCGGATGAT 660 CGAGGCGCGG TCGGGGAAGA TGCCCACGAC GTCGGTTCGG CGTCGTACCT CTCGGTTGAG 720 GCGTTCCTGG GGGTTGTTGG ACCAGATTTG GCGCCAGATC TTCTTGGGGA AGGCGGTGAA 780 CGCCAGCAGG TCGGTGCGGG CGGTGTCGAN GTGCTCGGCC ACCGCGGGGA GTTTGTCGGT 840 CAGAGCGTCG AGTACCCGAT CATATTGGGC AACAACTGAT TCGGCGTTGG GCTGGTCGTA 900 1905 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 6 GCTCGCCGGA TGTGGGCGTC CGCGCAAAAC ATTTCCGGTG CGGGCTGGAG TGGCATGGCC 60 GAGGCGACCT CGCTAGACAC CATGGCCCAG ATGAATCAGG CGTTTCGCAA CATCGTGAAC 120 ATGCTGCACG GGGTGCGTGA CGGGCTGGTT CGCGACGCCA ACAACTACGA GCAGCAAGAG 180 CAGGCCTCCC AGCAGATCCT CAGCAGCTAA CGTCAGCCGC TGCAGCACAA TACTTTTACA 240 AGCGAAGGAG AACAGGTTCG ATGACCATCA ACTATCAGTT CGGTGATGTC GACGCTCACG 300 GCGCCATGAT CCGCGCTCAG GCCGGGTTGC TGGAGGCCGA GCATCAGGCC ATCATTCGTG 360 ATGTGTTGAC CGCGAGTGAC TTTTGGGGCG GCGCCGGTTC GGCGGCCTGC CAGGGGTTCA 420 TTACCCAGTT GGGCCGTAAC TTCCAGGTGA TCTACGAACA AGCCAACACC CACGGGCAGA 480 AGGTGCAAGC TGCCGGCAAC AACATGGCGC AAACCGACAG CGCCGTCNGC TCCAGCTGGG 540 CCTGACACCA GGCCAAGGCC AGGGACGTGG TGTACNAGTG AAGGTTCCTC GCGTGATCCT 600 TCGGGTGGCA GTCTAGGTGG TCAGTGCTGG GGTGTTGGTG GTTTGCTGCT TGGCGGGTTC 660 TTCGGTGCTG GTCAGTGCTG CTCGGGCTCG GGTGAGGACC TCGAGGCCCA GGTAGCGCCG 720 TCCTTCGATC CATTCGTCGT GTTGTTCGGC GAGGACNGCT CCGACGANGC GGATGATCGA 780 GGCGCGGTCG GGGAAGATGC CCACGACGTC GGTTCGGCGT CGTACCTCTC GGTTGAAGCG 840 TTCCTGGGGG CCACCGCTTG GCGCCNANGC ACTCCACGCC AATTCGTCNC ACCTAACAGC 900 GGTGGCCAAC GACTATGACT ACGACACCGT TTTTGCCAGG GCCCTCNAAA GGATCTGCGC 960 GTCCCGGCGA CACGCTTTTT GCGATAAGTA CCTCCGGCAA TTCTATGAGT GTACTGCGGN 1020 CCGCGAAAAC CGCAAGGGAG TTGGGTGTGA CGGTTNTTGC AAATGACGGG CGAATCCGGC 1080 GGCCAGCTGG CAGAATTCGC AGATTTCTTG ATCAACGTCC CGTCACGCGA CACCGGGCGA 1140 ATCCAGGAAT CTCACATCGT TTTTATTCAT GCGATCTCCG AACATGTCGA ACACGCGCTT 1200 TTCGCGCCTC GCCAATAGGA AAGCCGATCC TTACGCGGCC ATTCGAAAGA TGGTCGCGGA 1260 ACGTGCGGGA CACCAATGGT GTCTCTTCCT CGATAGAGAC GGGGTCATCA ATCGACAAGT 1320 GGTCGGCGAC TACGTACGGA ACTGGCGGCA GTTTGAATGG TTGCCCGGGG CGGCGCGGGC 1380 GTTGAAGAAG CTACGGGCAT GGGCTCCGTA CATCGTTGTC GTGACAAACC AGCAGGGCGT 1440 GGGTGCCGGA TTGATGAGCG CCGTCGACGT GATGGTGATA CATCGGCACC TCCAAATGCA 1500 GCTTGCATCC GATGGCGTGC TGATAGATGG ATTTCAGGTT TGCCCGCACC ACCGTTCGCA 1560 GCGGTGTGGC TGCCGTAAGC CGAGACCGGG TCTGGTCCTC GACTGGCTCG GACGACACCC 1620 CGACAGTGAG CCATTGCTGA GCATCGTGGT TGGGGACAGC CTCAGCGATC TTGACATTGG 1680 CACACAACGT CGCCGCTGCT GCCGGTGCAT GTGCCAGTGT CCAGATAGGG GGCGCCAGTT 1740 CTGGCGGTGT CGCTGACGCG TCATTTGACT CGCTCTGGGA GTTCGCTGTC GCAGTCGGAC 1800 ATGCGCGGGG GGAGCGGGGC TAATGGCGAT CTTGCGCGGG CGAGCGCCGT NGCGGNTCGG 1860 ACTNNGCGGT GGCGGGACAG ACGTGGAACC GTACTCGAGC CAGTT 1905 2921 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 7 CGGGATGCCG TGGTGGTTGG TATTGCCCAA ACCCTGGCGC TGGTCCCCGG GGTATCCAGG 60 TCCGGGTCGA CCATCAGCGC TGGACTGTTT CTCGGACTCG ACCGTGAACT GGCCGCCCGA 120 TTCGGATTCC TGCTGGCCAT TCCAGCGGTG TTCGCCTCCG GGTTGTTCTC GTTGCCCGAC 180 GCATTCCACC CGGTAACCGA GGGCATGAGC GCTACTGGCC CGCAGTTGCT GGTGGCCACC 240 CTGATCGCGT TCGTCCTCGG TCTGACCGCG GTGGCCTGGC TGCTGCGGTT TCTGGTGCGA 300 CACAACATGT ACTGGTTCGT CGGCTACCGG GTGCTCGTCG GGACGGGCAT GCTCGTGCTG 360 CTGGCTACCG GGACGGTAGC CGCGACATGA CCGTCATCTT GCTACGCCAT GCCCGTTCCA 420 CCTCGAACAC CGCGGGCGTG CTGGCCGGCC GGTCCGGCGT CGACCTCGAC GAGAAGGGGC 480 GCGAGCAGGC CACCGGGTTG ATCGATCGAA TTGGTGACCT GCCGATCCGG GCGGTCGCGT 540 CTTCTCCAAT GCTGCGGTGT CAACGCACCG TCGAACCGCT GGCCGAGGCG CTGTGCCTGG 600 AGCCGCTCAT CGATGACCGG TTCTCCGAAG TCGACTACGG CGAATGGACT GGCAGAAAAA 660 TCGGTGACCT GGTCGACGAG CCGTTGTGGC GGGTAGTCCA GGCCCACCCC AGCGCGGCGG 720 TGTTTCCCGG CGGTGAGGGT TTGGCGCAGG TGCAGACGTG GTTGTCCTGA CGGATTTCCA 780 TGCCGGGGAA CACCAAGACC GGATCGGCAC TGGCGGTCGC CGGCGAAAAC CCGGCCGCCA 840 ATAGGGCGAC CGTCGCTGCG AATGCGCGTG GTACCAGGCG GACCACCTTG AACTCCCATC 900 CGTCGGGGCC AAGCGCATCG CCCGCCGCCG GTTACGGCTA AGGCGTACCA AAACCCGACG 960 GTAATACTTC GGCAATGTCG GGTCNCGACG TTACCGAGAC GTGACCAGNG AGGCNGCGGC 1020 ATTGGATTTA TCGATGGTGC GCGGTTCCCA NCCCGGCGGT CCGAANACGT AGCCCAGCCG 1080 ATCCCGCAGA CGTGTTGCCG ACCGCCAGTC ACGCACGATC GCCACGTACT CGCGGGTCTG 1140 CAGCTTCCAG ATGTTGAACG TGTCGACCCG CTTGGTCAGG CCATAATGCG GTCGGAATAG 1200 CTCCGGCTGA AAGCTACCGA ACAGGCGGTC CCAGATGATG AGGATGCCGC CATAGTTCTT 1260 GTCCANATAC ACCGGGTCCA TTCCGTGGTG GACCCGGTGG TGCGACGGGG TATTGAAGAC 1320 GAATTCGAAC CACCGCGGCA GCCTGTCGAT CCGCTCGGTG TGCACCCAGA ACTGGTAGAT 1380 CAAGTTCAGC GACCAATTGC AGAACACCAT CCAAGGGGGA AGCCCCATCA GTGGCAGCGG 1440 AACCCACATG AGAATCTCGC CGCTGTTGTT CCANTTTCTG GCGCAGCGCG GTGGCGAAGT 1500 TGAAGTATTC GCTGGAGTGA TGCGCCTGGT GGGTAGCCCA GATCAGCCGA ACTCGGTGGG 1560 CGATGCGGTG ATAGGAGTAG TACAGCAGAT CGACACCAAC GATCGCGATC ACCCAGGTGT 1620 ACCACCGGTG GGCGGACAGC TGCCAGGGGG CAAGGTAGGC ATAGATTGCG GCATAACCGA 1680 GCAGGGCAAG GGACTTCCAG CCGGCGGTGG TGGCTATCGA AACCAGCCCC ATCGAGATGC 1740 TGGCCACCGA GTCGCGGGTG AGGTAAGCGC CCGAGGCGGG CCGTGGCTGC CCGGTAGCAG 1800 CGGTCTCGAT GCTTTCCAGC TTGCGGGCCG CCGTCCATTC GAGAATCAGC AGCAATAGAA 1860 AACATGGAAT GGCGAACAGT ACCGGGTCCC GCATTTCCTC GGGCAGCGCT GAGAAGAATC 1920 CGGCGACGGC ATGGCCGAGG CGACCTCGNT AGACACCATG ACCCAGATGA ATCAGGCGTT 1980 TCGCAACATC GTGAACATGC TGCACGGGGT GCGTGACGGG CTGGTTCGCG ACGCCAACAA 2040 NTACGAACAG CAAGAGCAGG CCTCCCAGCA GATCCTCAGC AGCTGACCCG GCCCGACGAC 2100 TCAGGAGGAC ACATGACCAT CAACTATCAA TTCGGGGACG TCGACGCTCA CGGCGCCATG 2160 ATCCGCGCTC AGGCCGGGTC GCTGGAGGCC GAGCATCAGG CCATCATTTC TGATGTGTTG 2220 ACCGCGAGTG ACTTTTGGGG CGGCGCCGGT TCGGCGGCCT GCCAGGGGTT CATTACCCAG 2280 CTGGGCCGTA ACTTCCAGGT GATNTACGAG CAGGCCAACG CCCACGGGCA GAAGGTGCAG 2340 GCTGCCGGCA ACAACATGGC ACAAACCGAC AGCGCCGTCG GCTCCAGCTG GGCATAAAGN 2400 TGGCTTAAGG CCCGCGCCGT CAATTACAAC GTGGCCGCAC ACCGGTTGGT GTGTGGCCAC 2460 GTTGTTATCT GAACGACTAA CTACTTCGAC CTGCTAAAGT CGGCGCGTTG ATCCCCGGTC 2520 GGATGGTGCT GAACTGGGAA GATGGCCTCA ATGCCCTTGT TGCGGAAGGG ATTGAGGCCA 2580 TCGTGTTTCG TACTTTAGGC GATCAGTGCT GGTTGTGGGA GTCGCTGCTG CCCGACGAGG 2640 TGCGCCGACT GCCCGAGGAA CTGGCCCGGG TGGACGCATT GTTGGACGAT CCGGCGTTCT 2700 TCGCCCCGTT CGTGCCGTTC TTCGACCCGC GCAGGGGCCG GCCGTCGACG CCGATGGAGG 2760 TCTATCTGCA GTTGATGTTT GTGAAGTTCC GCTACCGGCT GGGCTATGAG TCGCTGTGCC 2820 GGGAGGTGGC TGATTCGATC ACCTGACGGC GGTTTTGCCG CATTGCGCTG GACGGGTCGG 2880 TGCCGCATCC GACCACATTG ATGAAGCTCA CCACGCGTTG C 2921 1704 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 8 CGCGATCGTC GTCAACGANG TCGACCGTCA CCACGGACTG ATCAACAAGT TCGCAGGCGA 60 CGCCGCCCTG GCCATCTTCG GAGCCCCGAA CCGCCTCGAC CGTCCCGAAG ACGCCGCGCT 120 GGCCGCCGCC CGGGCCATAN CCGANCGGCT GGCCNACGAG ATGCCCGAGG TCCAAGCCGG 180 CATCGGGGTG GCGGCAGGCC ANATCGTCGC CGGCAATGTC GGCGCCAAGC AAAGATTCNA 240 ATACACAGTG GTCGGCAAGC CGGTCAACCA NGCGGCCCGA TTGTGCGAAC TGGCCAAATC 300 ACACCCCGCG CGATTGGGTC TCGCCCGCTC GGCTCATGGT CACCCAATTC AAGGACTACT 360 TTGGCCTGGC GCACGACCTG CCGAAGTGGG CGAGTGAAGG CGCCAAAGCC GCCGGTGAGG 420 CCGCCAAGGC GTTGCCGGCC GCCGTTCCGG CCATTCCGAG TGCTGGCCTG AGCGGCGTTG 480 CGGGCGCCGT CGGTCAGGCG GCGTCGGTCG GGGGATTGAA GGTTCCGGCC GTTTGGACCG 540 CCACGACCCC GGCGGCGAGC CCCGCGGTGC TGGCGGCGTC CAACGGCCTC GGAGCCGCGG 600 CCGCCGCTGA AGGTTCGACA CACGCGTTTG GCGGGATGCC GCTCATGGGT ANCGGTGCCG 660 GACGTGCGTT TAACAACTTC GCTGCCCCTC GATACGGATT CAAGCCGACC GTGATCGCCC 720 AACCGCCGGC TGGCGGATGA CCAACTACGT TCGTTGATCG AGGATCGAAT TCNACGATTC 780 AAAGGGAGGA ATTCATATGA CCTCNCGTTT TATGACGGAT CCGCACGCNA TNCGGGACAT 840 GGCGGGCCGT TTTGAGGTGC ACGCCCAGAC GGTGGAGGAC GAGGCTNGCN GGATGTGGGC 900 GTCCGCGCAA AACATTTCCG GTGCGGGCTG GAGTGGCATG GCCGAGGCGA CCTCGNTAGA 960 CACCATGGCC CAGATGAATC AGGCGTTTCN CAACATCGTG AACATGCTGC ACGGGGTGNG 1020 TGACGGGCTG GTTCGCGACG CCAACAACTA CGAACAGCAA GAGCAGGCCT CCCAGCAGAT 1080 CCTCAGCAGC TGACCCGGCC CGACGACTCA GGAGGACACA TGACCATCAA CTATCAATTC 1140 GGGGACGTCG ACGCTCATGG CGCCATGATC CGCGCTNTGG CCGGGTTGCT GGAGGCCGAG 1200 CATCAGGCCA TCATTTCTGA TGTGTTGACC GCGAGTGACT TTTGGGGCGG CGCCGGTTCG 1260 GCGGCCTGCC AGGGGTTCAT TACCCAGTTG GGCCGTAACT TCCAGGTGAT TTACGAGCAG 1320 GCCAACGCCC ACGGGCAGAA GGTGCAGGCT GCCGGCAACA ACATGGCACA AACCGACAGC 1380 GCCGTNGGNT CCAGCTGGGC CTAACCCGGG TCNTAAGTTG GGTCCGCGCA GGGCGGGCCG 1440 ATCAGCGTNG ACTTTGGCGC CCGATACACG GGCATNTTNT NGTCGGGAAC ACTGCGCCCG 1500 CGTCAGNTGC CCGCTTCCCC TTGTTNGGCG ACGTGCTCGG TGATGGCTTT GACGACCGCT 1560 TCGCCGGCGC GGCCAATCAA TTGGTCGCGC TTGCCTNTAG CCCATTCGTG CGACGCCCGC 1620 GGCGCCGCGA GTTGTCCCTT GAAATAAGGA ATCACAGCAC GGGCGAACAG CTCATAGGAG 1680 TGAAAGGTTG CCGTGGCGGG GCCC 1704 2286 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 9 CCGTCTTGGC GTCTGGGCGC ATTGTGATCT GGGCCANTTG CCCCTCCACC CAGACCGCGC 60 CCAGCTTGTC GATCCAGCCC GCGACCCGGA TTGCCACCGC GCGAACCGGG AACGGATTCT 120 CCGCTGAATT CTGGGTCACT TCGCAGTCGC GCGGGTGATC CTGTTGGCGA NCAGCGTCTG 180 GAACGGGCGT CNAACGCGTG CCGTAAGCCC AGCGTGTACG CCGTCAGCCC GACGCCGATG 240 CCGAATGCCT TGCCGCCCAA GCTGAGCCGC GCGGGCTCCA CCAAGAGCGT CACGGTGAGC 300 CAGCCAACCA GATGCAAGGC GACGATCACC GCGAAGTGCC GAATTCGGCA CGAGAGGTGC 360 TGGAAATCCA GCAATACGCC CGCGAGCCGA TCTCGTTGGA CCAGACCATC GGCGACGANG 420 GCGACAGNCA GCTTGGCGAT TTCATCGAAA ACAGCGAGGC GGTGGTGGNC GTCGACGCGG 480 TGTCCTTCAC TTTGCTGCAT GATCAACTGC ANTCGGTGCT GGACACGCTC TCCGAGCGTG 540 AGGCGGGCGT GGTGCGGCTA CGCTTCGGCC TTACCGACGG CCAGCCGCGC ACCCTTGACG 600 AGATCGGCCA GGTCTACGGC GTGACCCGGG AACGCATCCG CCAGATCGAA TCCAAGACTA 660 TGTCGAAGTT GCGCCATCCG AGCCGCTCAC AGGTCCTGCG CGACTATCGT GCCGAATTCG 720 GCACGAGCCG TTTTGAGGTG CACGCCCAGA CGGTGGAGGA CGAGGCTCGC CGGATGTGGG 780 CGTCCGCGCA AAACATTTCC GGTGCGGGCT GGAGTGGCAT GGCCGANGCG ACCTCGCTAG 840 ACACCATGGC CCAGATGAAT CAGGCGTTTC GCAACATCGT GAACATGCTG CACGGGGTGC 900 GTGACGGGCT GGTTCGCGAC GCCAACAACT ACGAACAGCA AGAGCAGGCC TCCCAGCAGA 960 TCCTCAGCAG CTGACCCGGC CCGACGACTC AGGAGGACAC ATGACCATCA ACTATCAATT 1020 CGGGGACGTC GACGCTCATG GCGCCATGAT CCGCGCTCTG GCCGGGTTGC TGGAGGCCGA 1080 GCATCAGGCC ATCATTTCTG ATGTGTTGAC CGCGAGTGAC TTTTGGGGCG GCGCCGGTTC 1140 GGCGGCCTGC CAGGGGTTCA TTACCCAGTT GGGCCGTAAC TTCCAGGTGA TCTACGAGCA 1200 GGCCAACGCC CACGGGCAGA AGGTGCAGGC TGCCGGCAAC AACATGGCAC AAACCGACAG 1260 CGCCGTCGGC TCCAGCTGGG CCTAACCCGG GTCCTAAGTT GGGTCCGCGC AGGGCGGGCC 1320 GATCAGCGTC GACTTTGGCG CCCGATACAC GGGCATGTNG TNGTCGGGAA CACTGCGCCC 1380 GCGTCAGCTG CCCGCTTCCC CTTGTTCGGC GACGTGCTCG GTGATGGCTT TGACGACCGC 1440 TTCGCCGGCG CGGCCAATCA ATTGGTCGCG CTTGCCTCTA GCCTCGTGCC GAATTCGGCA 1500 CGAGGGTGCT GGTGCCGCGC TATCGGCAGC ACGTGAGCTC CACGACGAAC TCATCCCAGT 1560 GCTGGGTTCC GCGGAGTTCG GCATCGGCGT GTCGGCCGGA AGGGCCATCG CCGGCCACAT 1620 CGGCGCTCAA GCCCGCTTCG AGTACACCGT CATCGGCGAC CCGGTCAACG AGGCCGCCCG 1680 GCTCACCGAA CTGGCCAAAG TCGAGGATGG CCACGTTCTG GCGTCGGCGA TCGCGGTCAG 1740 TGGCGCCCTG GACGCCGAAG CATTGTGTTG GGATGTTGGC GAGGTGGTTG AGCTCCGCGG 1800 ACGTGCTGCA CCCACCCAAC TAGCCAGGCC AATGAATNTG GCNGCACCCG AAGAGGTTTC 1860 CAGCGAAGTA CGCGGCTAGT CGCGCTTGGC TGCNTTCTTC GCCGGCACCT TCCGGGCAGC 1920 TTTCCTGGCT GGCCGTTTTG CCGGACCCCG GGCTCGGCGA TCGGCCAACA GCTCGGCGGC 1980 GCGCTCGTCG GTTATGGAAG CCACGTNGTC GCCCTTACGC AGGCTGGCAT TGGTCTCACC 2040 GTCGGTGACG TACGGCCCGA ATCGGCCGTC CTTGATGACC ATTGGCTTGC CAGACGCCGG 2100 ATNTGNTCCC AGCTCGCGCA GCGGCGGAGC CGAAGCGCTT TGCCGGCCAC GACNTTTCGG 2160 CTCTGNGTAG ATNTTCAGGG CTTCGTCGAG CGNGATGGTG AATATATGGT CTTCGGTGAC 2220 CAGTGATCGA GAATCGTTGC CGCGCTTTAG ATACGGTCNG TAGCGCCCGT TCTGCGCGGT 2280 GATNTC 2286 1136 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 10 GGGCATCTTC CCCGACCGCG CCTCGATCAT CCGCCTCGTC GGAGCCGTCC TCGCCGAACA 60 ACACGACGAA TGGATCGAAG GACGGCGCTA CCTGGGCCTC GAGGTCCTCA CCCGAGCCCG 120 AGCAGCACTG ACCAGCACCG AAGAACCGCC AAGCAGCAAA CCACCAACAC CCCAGCACTG 180 ACCACCTAGA CTGCCACCCG AAGGATCACG CGAGGAACCT TCACTCGTAC ACCACGTCCC 240 TGGCCTTGGC CTGGTGTCAG GCCCAGCTGG AGCCGACGGC GCTGTCGGTT TGCGCCATGT 300 TGTTGCCGGC AGCCTGCACC TTCTGCCCGT GGGCGTTGGC CTGCTCGTAG ATCACCTGGA 360 AGTTACGGCC CAACTGGGTA ATGAACCCCT GGCAGGCCGC CGAACCGGCG CCGCCCCAAA 420 AGTCACTCGC GGTCAACACA TCACGAATGA TGGCCTGATG CTCGGCCTCC AGCAACCCGG 480 CCTGAGCGCG GATCATGGCG CCGTGAGCGT CGACATCACC GAACTGATAG TTGATGGTCA 540 TCGAACCTGT TCTCCTTCGC TTGTAAAAGT ATTGTGCTGC AGCGGCTGAC GTTAGCTGCT 600 GAGGATCTGC TGGGAGGCCT GCTCTTGCCT CGTGCCGAAT TCGGCACGAG AGGCCGCCTT 660 CGAAGAAATC CTTTGAGAAT TCGCCAAGGC CGTCGACCCA GCATGGGGTC AGCTCGCCAG 720 CCGCGCCGGC TGGCAACCGT TCCCGCTCGA GAAAGACCTG GAGGAATACC AGTGACAAAC 780 GACCTCCCAG ACGTCCGAGA GCGTGACGGC GGTCCACGTC CCGCTCCTCC TGCTGGCGGG 840 CCACGCTTGT CAGACGTGTG GGTTTACAAC GGGCGGGCGT ACGACCTGAG TGAGTGGATT 900 TCCAAGCATC CCGGCGGCGC CTTNTTCATT GGGCGGACCA AGAACCGCGA CATCACCGCA 960 ATCGTCAAGT CCTACCATCG TGATCCGGCG ATTGTCGAGC GAATCCTGCA GCGGAGGTAC 1020 GCGTTGGGCC GCGACGCAAC CCCTAGGGAC ATCCACCCCA AGCACAATGC ACCGGCATTT 1080 CTGTTCAAAG ACGACTTCAA CAGCTGGCGG GACACCCCGA AGTATCGATT NGACGA 1136 967 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 11 TGAGCGCCAA CCCTACCGTC GGTTCGTCAC ACGGACCGCA TGGCCTGCTC CGCGGACTGC 60 CGCTAGGGTC GCGGATCACT CGGCGTAGCG GCGCCTTTGC CCACCGATAT GGGTTCCGTC 120 ACAGTGTGGT TGCCCGCCCG CCATCGGCCG GATAACGCCA TGACCTCAGC TCGGCAGAAA 180 TGACAATGCT CCCAAAGGCG TGAGCACCCG AAGACAACTA AGCAGGAGAT CGCATGCCGT 240 TTGTGACTAC CCAACCAGAA GCACTGGCGG CGGCGGCCGG CAGTCTGCAG GGAATCGGCT 300 CCGCATTGAA CGCCCAGAAT GCGGCTGCGG CGACTCCCAC GACGGGGGTG GTCCGGCGGC 360 CGCCGATGAA NTGTCGGCGC TGACGGCGGC TCAGTTCGCG GCACACGCCC AGATCTATCA 420 GGCCGTCAGC GCCCAGGCCG CGGCGATTCA CGAGATGTTC GTCAACACTC TACAGATGAG 480 CTCAGGGTCG TATGCTGCTA CCGAGGCCGC CAACGCGGCC GCGGCCGGNT AGAGGAGTCA 540 CTGCGATGGA TTTTGGGGCG TTGCCGCCGG AGGTCAATTC GGTGCGGATG TATGCCGTTC 600 CTGGCTCGGC ACCAATGGTC GCTGCGGCGT CGGCCTGGAA CGGGTTGGCC GCGGAGCTGA 660 GTTCGGCGGC CACCGGTTAT GAGACGGTGA TCACTCAGCT CAGCAGTGAG GGGTGGCTAG 720 GTCCGGCGTC AGCGGCGATG GCCGAGGCAG TTGCGCCGTA TGTGGCGTGG ATGAGTGCCG 780 CTGCGGCGCA AGCCGAGCAG GCGGCCACAC AGGCCAGGGC CGCCGCGGCC GCTTTTGAGG 840 CGGCGTTTGC CGCGACGGTG CCTCCGCCGT TGATCGCGGC CAACCGGGCT TCGTTGATGC 900 AGCTGATCTC GACGAATGTC TTTGGTCAGA ACACCTCGGC GATCGCGGCC GCCGAAGCTC 960 AGTACGG 967 585 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 12 TGGATTCCGA TAGCGGTTTC GGCCCCTCGA CGGGCGACCA CGGCGCGCAG GCCTCCGAAC 60 GGGGGGCCGG GACGCTGGGA TTCGCCGGGA CCGCAACCAA AGAACGCCGG GTCCGGGCGG 120 TCGGGCTGAC CGCACTGGCC GGTGATGAGT TCGGCAACGG CCCCCGGATG CCGATGGTGC 180 CGGGGACCTG GGAGCAGGGC AGCAACGAGC CCGAGGCGCC CGACGGATCG GGGAGAGGGG 240 GAGGCGACGG CTTACCGCAC GACAGCAAGT AACCGAATTC CGAATCACGT GGACCCGTAC 300 GGGTCGAAAG GAGAGATGTT ATGAGCCTTT TGGATGCTCA TATCCCACAG TTGGTGGCCT 360 CCCAGTCGGC GTTTGCCGCC AAGGCGGGGC TGATGCGGCA CACGATCGGT CAGGCCGAGC 420 AGGCGGCGAT GTCGGCTCAG GCGTTTCACC AGGGGGAGTC GTCGGCGGCG TTTCAGGCCG 480 CCCATGCCCG GTTTGTGGCG GCGGCCGCCA AAGTCAACAC CTTGTTGGAT GTCGCGCAGG 540 CGAATCTGGG TGAGGCCGCC GGTACCTATG TGGCCGCCGA TGCTG 585 144 amino acids amino acid single linear peptide Mycobacterium tuberculosis 13 Ala Leu Val Thr Thr Asn Phe Phe Gly Val Asn Thr Ile Pro Ile Ala 1 5 10 15 Leu Asn Glu Ala Asp Tyr Leu Arg Met Trp Ile Gln Ala Ala Thr Val 20 25 30 Met Ser His Tyr Gln Ala Val Ala His Glu Ile Trp Cys Leu His Glu 35 40 45 Xaa Ala Ser Ser Gly Lys Pro Trp Ala Ser Ile Thr Thr Gly Ala Pro 50 55 60 Gly Ser Pro Ala Ser Thr Thr Arg Ser Arg Thr Pro Leu Val Ser Thr 65 70 75 80 Asn Arg Xaa Val Xaa Ala Pro Ile Val Ser Pro Asn His Thr Gly His 85 90 95 Arg Pro Glu Lys Gly Leu Gly Ser Xaa Gln Arg Arg Leu Ser Arg Val 100 105 110 Leu Pro Arg Ile Ile Asp Arg Pro Ala Gly Pro Xaa Gly Pro Pro Leu 115 120 125 Thr Ser Gly Ser His Phe Leu Cys Ser Trp His Gly Tyr Ser Ser Gln 130 135 140 352 amino acids amino acid single linear peptide Mycobacterium tuberculosis 14 His Ala Leu Ala Ala Gln Tyr Thr Glu Ile Ala Thr Glu Leu Ala Ser 1 5 10 15 Val Leu Ala Ala Val Gln Ala Ser Ser Trp Gln Gly Pro Ser Ala Asp 20 25 30 Arg Phe Val Val Ala His Gln Pro Phe Arg Tyr Trp Leu Thr His Ala 35 40 45 Ala Thr Val Ala Thr Ala Ala Ala Ala Ala His Xaa Thr Ala Ala Ala 50 55 60 Gly Tyr Thr Ser Ala Leu Gly Gly Met Pro Thr Leu Ala Glu Leu Ala 65 70 75 80 Ala Asn His Ala Met His Gly Ala Leu Val Thr Thr Asn Phe Phe Gly 85 90 95 Val Asn Thr Ile Pro Ile Ala Leu Asn Glu Ala Asp Tyr Leu Arg Met 100 105 110 Trp Ile Gln Ala Ala Thr Val Met Ser His Tyr Gln Ala Val Ala His 115 120 125 Glu Ser Val Ala Ala Thr Pro Ser Thr Pro Pro Ala Pro Gln Ile Val 130 135 140 Thr Ser Ala Ala Ser Ser Ala Ala Ser Ser Ser Phe Pro Asp Pro Thr 145 150 155 160 Lys Leu Ile Leu Gln Leu Leu Lys Asp Phe Leu Glu Leu Leu Arg Tyr 165 170 175 Leu Ala Val Glu Leu Leu Pro Gly Pro Leu Gly Asp Leu Ile Ala Gln 180 185 190 Val Leu Asp Trp Phe Ile Ser Phe Val Ser Gly Pro Val Phe Thr Phe 195 200 205 Leu Ala Tyr Leu Val Leu Asp Pro Leu Ile Tyr Phe Gly Pro Phe Ala 210 215 220 Pro Leu Thr Ser Pro Val Leu Leu Pro Ala Val Glu Leu Arg Asn Arg 225 230 235 240 Leu Lys Thr Ala Thr Gly Leu Thr Leu Pro Pro Thr Val Ile Phe Asp 245 250 255 His Pro Thr Pro Thr Ala Val Ala Glu Tyr Val Ala Gln Gln Met Ser 260 265 270 Gly Ser Arg Pro Thr Glu Ser Gly Asp Pro Thr Ser Gln Val Val Glu 275 280 285 Pro Ala Arg Ala Glu Phe Gly Thr Ser Ala Val His Gln Ile Pro Pro 290 295 300 Arg Pro Ala Asp Thr Arg Arg Ala Cys Arg His Arg Asp Asp Val Pro 305 310 315 320 Arg Asp Ser Arg Ile Ala Gln His Arg Asp Gly Ala Gly Leu Asp Pro 325 330 335 Thr Glu Arg Gly Thr Ser Glu Gly Asp Gln Gly Leu Val Ser Gly Trp 340 345 350 141 amino acids amino acid single linear peptide Mycobacterium tuberculosis 15 Met Asp Phe Gly Ala Leu Pro Pro Glu Val Asn Ser Val Arg Met Tyr 1 5 10 15 Ala Val Pro Gly Ser Ala Pro Met Val Ala Ala Ala Ser Ala Trp Asn 20 25 30 Gly Leu Ala Ala Glu Leu Ser Ser Ala Ala Thr Gly Tyr Glu Thr Val 35 40 45 Ile Thr Gln Leu Ser Ser Glu Gly Trp Leu Gly Pro Ala Ser Ala Ala 50 55 60 Met Ala Glu Ala Val Ala Pro Tyr Val Ala Trp Met Ser Ala Ala Ala 65 70 75 80 Ala Gln Ala Glu Gln Ala Ala Thr Gln Ala Arg Ala Ala Ala Ala Ala 85 90 95 Phe Glu Ala Ala Phe Ala Ala Thr Val Pro Pro Pro Leu Ile Ala Ala 100 105 110 Asn Arg Ala Ser Leu Met Gln Leu Ile Ser Thr Asn Val Phe Gly Gln 115 120 125 Asn Thr Ser Ala Ile Ala Ala Ala Glu Ala Gln Tyr Gly 130 135 140 58 amino acids amino acid single linear peptide Mycobacterium tuberculosis 16 Met Ala Ser Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala 1 5 10 15 Gly Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg 20 25 30 Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met 35 40 45 Ala Glu Ala Thr Ser Leu Asp Thr Met Thr 50 55 67 amino acids amino acid single linear peptide Mycobacterium tuberculosis 17 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Ala Ser Leu Glu Ala Glu His Gln Ala Ile Val 20 25 30 Arg Asp Val Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala Gly Ser Val 35 40 45 Ala Cys Gln Glu Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln 65 58 amino acids amino acid single linear peptide Mycobacterium tuberculosis 18 Met Ala Ser Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala 1 5 10 15 Gly Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg 20 25 30 Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met 35 40 45 Ala Glu Ala Thr Ser Leu Asp Thr Met Thr 50 55 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 19 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Ala Ser Leu Glu Ala Glu His Gln Ala Ile Val 20 25 30 Arg Asp Val Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala Gly Ser Val 35 40 45 Ala Cys Gln Glu Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 30 amino acids amino acid single linear peptide Mycobacterium tuberculosis 20 Asn Met Leu His Gly Val Arg Asp Gly Leu Val Arg Asp Ala Asn Asn 1 5 10 15 Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln Ile Leu Ser Ser 20 25 30 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 21 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Gly Leu Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Arg Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 69 amino acids amino acid single linear peptide Mycobacterium tuberculosis 22 Ala Arg Arg Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp 1 5 10 15 Ser Gly Met Ala Glu Ala Thr Ser Leu Asp Thr Met Ala Gln Met Asn 20 25 30 Gln Ala Phe Arg Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly 35 40 45 Leu Val Arg Asp Ala Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser Gln 50 55 60 Gln Ile Leu Ser Ser 65 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 23 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Gly Leu Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Arg Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Thr His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Xaa Ser Ser Trp Ala 85 90 52 amino acids amino acid single linear peptide Mycobacterium tuberculosis 24 Gly Met Ala Glu Ala Thr Ser Xaa Asp Thr Met Thr Gln Met Asn Gln 1 5 10 15 Ala Phe Arg Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly Leu 20 25 30 Val Arg Asp Ala Asn Xaa Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln 35 40 45 Ile Leu Ser Ser 50 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 25 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Gly Ser Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Ser Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Xaa 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 98 amino acids amino acid single linear peptide Mycobacterium tuberculosis 26 Met Thr Ser Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala 1 5 10 15 Gly Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg 20 25 30 Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met 35 40 45 Ala Glu Ala Thr Ser Leu Asp Thr Met Ala Gln Met Asn Gln Ala Phe 50 55 60 Arg Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly Leu Val Arg 65 70 75 80 Asp Ala Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln Ile Leu 85 90 95 Ser Ser 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 27 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Xaa Ala Gly Leu Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Ser Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 81 amino acids amino acid single linear peptide Mycobacterium tuberculosis 28 Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg Met 1 5 10 15 Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met Ala 20 25 30 Xaa Ala Thr Ser Leu Asp Thr Met Ala Gln Met Asn Gln Ala Phe Arg 35 40 45 Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly Leu Val Arg Asp 50 55 60 Ala Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln Ile Leu Ser 65 70 75 80 Ser 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 29 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Leu Ala Gly Leu Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Ser Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 11 amino acids amino acid single linear peptide Mycobacterium tuberculosis 30 Gln Glu Gln Ala Ser Gln Gln Ile Leu Ser Ser 1 5 10 94 amino acids amino acid single linear peptide Mycobacterium tuberculosis 31 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala Met 1 5 10 15 Ile Arg Ala Gln Ala Gly Leu Leu Glu Ala Glu His Gln Ala Ile Ile 20 25 30 Arg Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 35 40 45 Ala Cys Gln Gly Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile 50 55 60 Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn 65 70 75 80 Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 85 90 99 amino acids amino acid single linear peptide Mycobacterium tuberculosis 32 Met Ser Phe Val Thr Thr Gln Pro Glu Ala Leu Ala Ala Ala Ala Ala 1 5 10 15 Asn Leu Gln Gly Ile Gly Thr Thr Met Asn Ala Gln Asn Ala Ala Ala 20 25 30 Ala Ala Pro Thr Thr Gly Val Val Pro Ala Ala Ala Asp Glu Val Ser 35 40 45 Ala Leu Thr Ala Ala Gln Phe Ala Ala His Ala Gln Met Tyr Gln Thr 50 55 60 Val Ser Ala Gln Ala Ala Ala Ile His Glu Met Phe Val Asn Thr Leu 65 70 75 80 Val Ala Ser Ser Gly Ser Tyr Ala Ala Thr Glu Ala Ala Asn Ala Ala 85 90 95 Ala Ala Gly 99 amino acids amino acid single linear peptide Mycobacterium tuberculosis 33 Met Ser Phe Val Thr Thr Gln Pro Glu Ala Leu Ala Ala Ala Ala Ala 1 5 10 15 Asn Leu Gln Gly Ile Gly Thr Thr Met Asn Ala Gln Asn Ala Ala Ala 20 25 30 Ala Ala Pro Thr Thr Gly Val Val Pro Ala Ala Ala Asp Glu Val Ser 35 40 45 Ala Leu Thr Ala Ala Gln Phe Ala Ala His Ala Gln Met Tyr Gln Thr 50 55 60 Val Ser Ala Gln Ala Ala Ala Ile His Glu Met Phe Val Asn Thr Leu 65 70 75 80 Val Ala Ser Ser Gly Ser Tyr Ala Ala Thr Glu Ala Ala Asn Ala Ala 85 90 95 Ala Ala Gly 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 34 Asp Pro His Ala Met Arg Asp Met Ala Gly Arg Phe Glu Val His 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 35 Arg Asp Met Ala Gly Arg Phe Glu Val His Ala Gln Thr Val Glu 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 36 Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 37 Ala Gln Thr Val Glu Asp Glu Ala Arg Arg Met Trp Ala Ser Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 38 Asp Glu Ala Arg Arg Met Trp Ala Ser Ala Gln Asn Ile Ser Gly 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 39 Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 40 Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met Ala Glu Ala Thr 1 5 10 15 16 amino acids amino acid single linear peptide Mycobacterium tuberculosis 41 Ala Gly Trp Ser Gly Met Ala Glu Ala Thr Ser Leu Asp Thr Met Thr 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 42 Met Ala Glu Ala Thr Ser Leu Asp Thr Met Ala Gln Met Asn Gln 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 43 Ser Leu Asp Thr Met Ala Gln Met Asn Gln Ala Phe Arg Asn Ile 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 44 Ala Gln Met Asn Gln Ala Phe Arg Asn Ile Val Asn Met Leu His 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 45 Ala Phe Arg Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 46 Val Asn Met Leu His Gly Val Arg Asp Gly Leu Val Arg Asp Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 47 Gly Val Arg Asp Gly Leu Val Arg Asp Ala Asn Asn Tyr Glu Gln 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 48 Leu Val Arg Asp Ala Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser 1 5 10 15 16 amino acids amino acid single linear peptide Mycobacterium tuberculosis 49 Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln Ile Leu Ser Ser 1 5 10 15 17 amino acids amino acid single linear peptide Mycobacterium tuberculosis 50 Met Ala Ser Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala 1 5 10 15 Gly 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 51 Met Thr Ile Asn Tyr Gln Phe Gly Asp Val Asp Ala His Gly Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 52 Gln Phe Gly Asp Val Asp Ala His Gly Ala Met Ile Arg Ala Gln 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 53 Asp Ala His Gly Ala Met Ile Arg Ala Gln Ala Ala Ser Leu Glu 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 54 Met Ile Arg Ala Gln Ala Ala Ser Leu Glu Ala Glu His Gln Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 55 Ala Ala Ser Leu Glu Ala Glu His Gln Ala Ile Val Arg Asp Val 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 56 Ala Glu His Gln Ala Ile Val Arg Asp Val Leu Ala Ala Gly Asp 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 57 Ile Val Arg Asp Val Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala 1 5 10 15 16 amino acids amino acid single linear peptide Mycobacterium tuberculosis 58 Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala Gly Ser Val Ala Cys Gln 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 59 Phe Trp Gly Gly Ala Gly Ser Val Ala Cys Gln Glu Phe Ile Thr 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 60 Gly Ser Val Ala Cys Gln Glu Phe Ile Thr Gln Leu Gly Arg Asn 1 5 10 15 18 amino acids amino acid single linear peptide Mycobacterium tuberculosis 61 Gln Glu Phe Ile Thr Gln Leu Gly Arg Asn Phe Gln Val Ile Tyr Glu 1 5 10 15 Gln Ala 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 62 Arg Asn Phe Gln Val Ile Tyr Glu Gln Ala Asn Ala His Gly Gln 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 63 Ile Tyr Glu Gln Ala Asn Ala His Gly Gln Lys Val Gln Ala Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 64 Asn Ala His Gly Gln Lys Val Gln Ala Ala Gly Asn Asn Met Ala 1 5 10 15 15 amino acids amino acid single linear peptide Mycobacterium tuberculosis 65 Lys Val Gln Ala Ala Gly Asn Asn Met Ala Gln Thr Asp Ser Ala 1 5 10 15 16 amino acids amino acid single linear peptide Mycobacterium tuberculosis 66 Gly Asn Asn Met Ala Gln Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 1 5 10 15 1617 base pairs nucleic acid double linear DNA (genomic) Mycobacterium tuberculosis 67 CATCGGAGGG AGTGATCACC ATGCTGTGGC ACGCAATGCC ACCGGAGNTA AATACCGCAC 60 GGCTGATGGC CGGCGCGGGT CCGGCTCCAA TGCTTGCGGC GGCCGCGGGA TGGCAGACGC 120 TTTCGGCGGC TCTGGACGCT CAGGCCGTCG AGTTGACCGC GCGCCTGAAC TCTCTGGGAG 180 AAGCCTGGAC TGGAGGTGGC AGCGACAAGG CGCTTGCGGC TGCAACGCCG ATGGTGGTCT 240 GGCTACAAAC CGCGTCAACA CAGGCCAAGA CCCGTGCGAT GCAGGCGACG GCGCAAGCCG 300 CGGCATACAC CCAGGCCATG GCCACGACGC CGTCGCTGCC GGAGATCGCC GCCAACCACA 360 TCACCCAGGC CGTCCTTACG GCCACCAACT TCTTCGGTAT CAACACGATC CCGATCGCGT 420 TGACCGAGAT GGATTATTTC ATCCGTATGT GGAACCAGGC AGCCCTGGCA ATGGAGGTCT 480 ACCAGGCCGA GACCGCGGTT AACACGCTTT TCGAGAAGCT CGAGCCGATG GCGTCGATCC 540 TTGATCCCGG CGCGAGCCAG AGCACGACGA ACCCGATCTT CGGAATGCCC TCCCCTGGCA 600 GCTCAACACC GGTTGGCCAG TTGCCGCCGG CGGCTACCCA GACCCTCGGC CAACTGGGTG 660 AGATGAGCGG CCCGATGCAG CAGCTGACCC AGCCGCTGCA GCAGGTGACG TCGTTGTTCA 720 GCCAGGTGGG CGGCACCGGC GGCGGCAACC CAGCCGACGA GGAAGCCGCG CAGATGGGCC 780 TGCTCGGCAC CAGTCCGCTG TCGAACCATC CGCTGGCTGG TGGATCAGGC CCCAGCGCGG 840 GCGCGGGCCT GCTGCGCGCG GAGTCGCTAC CTGGCGCAGG TGGGTCGTTG ACCCGCACGC 900 CGCTGATGTC TCAGCTGATC GAAAAGCCGG TTGCCCCCTC GGTGATGCCG GCGGCTGCTG 960 CCGGATCGTC GGCGACGGGT GGCGCCGCTC CGGTGGGTGC GGGAGCGATG GGCCAGGGTG 1020 CGCAATCCGG CGGCTCCACC AGGCCGGGTC TGGTCGCGCC GGCACCGCTC GCGCAGGAGC 1080 GTGAAGAAGA CGACGAGGAC GACTGGGACG AAGAGGACGA CTGGTGAGCT CCCGTAATGA 1140 CAACAGACTT CCCGGCCACC CGGGCCGGAA GACTTGCCAA CATTTTGGCG AGGAAGGTAA 1200 AGAGAGAAAG TAGTCCAGCA TGGCAGAGAT GAAGACCGAT GCCGCTACCC TCGCGCAGGA 1260 GGCAGGTAAT TTCGAGCGGA TCTCCGGCGA CCTGAAAACC CAGATCGACC AGGTGGAGTC 1320 GACGGCAGGT TCGTTGCAGG GCCAGTGGCG CGGCGCGGCG GGGACGGCCG CCCAGGCCGC 1380 GGTGGTGCGC TTCCAAGAAG CAGCCAATAA GCAGAAGCAG GAACTCGACG AGATCTCGAC 1440 GAATATTCGT CAGGCCGGCG TCCAATACTC GAGGGCCGAC GAGGAGCAGC AGCAGGCGCT 1500 GTCCTCGCAA ATGGGCTTCT GACCCGCTAA TACGAAAAGA AACGGAGCAA AAACATGACA 1560 GAGCAGCAGT GGAATTTCGC GGGTATCGAG GCCGCGGCAA GCGCAATCCA GGGAAAT 1617 25 amino acids amino acid single linear peptide Mycobacterium tuberculosis 68 Val Thr Thr Asn Phe Phe Gly Val Asn Thr Ile Pro Ile Ala Leu Asn 1 5 10 15 Glu Ala Asp Tyr Leu Arg Met Trp Ile 20 25 25 amino acids amino acid single linear peptide Mycobacterium tuberculosis 69 Asn Glu Ala Asp Tyr Leu Arg Met Trp Ile Gln Ala Ala Thr Val Met 1 5 10 15 Ser His Tyr Gln Ala Val Ala His Glu 20 25 

1. A method for identifying DNA sequences that encode CD4+ T cell stimulating antigens, comprising: a) transforming a host cell with a plasmid suspected of containing a DNA sequence that encodes a CD4+ T cell stimulating antigen and inducing antigen expression; b) incubating the transformed host cell with at least one dendritic cell for a period of time sufficient to form an MHC II/peptide complex on the dendritic cell, wherein the peptide is derived from the expressed antigen; c) incubating the dendritic cell with CD4+ T cells and determining the level of CD4+ T cell stimulation, thereby identifying a transformed host cell that expresses at least one CD4+ T cell stimulating antigen; and d) isolating DNA from the transformed host cell.
 2. The method of claim 1 wherein the level of CD4+ T cell stimulation is determined by measuring production of at least one cytokine.
 3. The method of claim 2 wherein the cytokine is interferon-gamma.
 4. The method of claim 1 wherein the level of CD4+ T cell stimulation is determined by measuring cell proliferation.
 5. The method of claim 1 wherein the plasmid contains DNA isolated from an infectious disease agent.
 6. The method of claim 5 wherein the infectious disease agent is M. tuberculosis.
 7. The method of claim 1 wherein the plasmid contains DNA isolated from tumor tissue.
 8. The method of claim 1 wherein the host cell is selected from the group consisting of E. coli, yeast and mammalian cells.
 9. A method for identifying DNA sequences that encode CD4+ T cell stimulating antigens, comprising: a) transforming a host cell with a plasmid suspected of containing a DNA sequence that encodes a CD4+ T cell stimulating antigen and inducing antigen expression; b) incubating the transformed host cell with at least one dendritic cell for a period of time sufficient to form an MHC II/peptide complex on the dendritic cell, wherein the peptide is derived from the expressed antigen; and c) incubating the dendritic cell with CD4+ T cells and determining the level of CD4+ T cell stimulation, thereby identifying a transformed host cell that expresses at least one CD4+ T cell stimulating antigen.
 10. The method of claim 9 additionally comprising isolating DNA from the transformed host cell.
 11. An isolated DNA sequence selected from the group consisting of: a) DNA sequences isolated using the method of claim 1; b) DNA sequences complementary to DNA sequences of a); and c) DNA sequences that hybridize to a DNA sequence of a) or b) under moderately stringent conditions.
 12. An expression vector comprising a DNA molecule according to claim
 11. 13. A host cell transformed with an expression vector according to claim
 12. 14. A polypeptide comprising an immunogenic portion of an antigen, or a variant of said antigen that differs only in conservative substitutions and/or modifications, the antigen comprising an amino acid sequence encoded by a DNA sequence according to claim
 11. 15. A method for identifying DNA sequences that encode antigens comprising antibody epitopes, the method comprising: a) transforming a host cell with a plasmid suspected of containing a DNA sequence that encodes a CD4+ T cell stimulating antigen and inducing antigen expression; b) incubating the transformed host cell with at least one dendritic cell for a period of time sufficient to form an MHC II/peptide complex on the dendritic cell, wherein the peptide is derived from the expressed antigen; c) incubating the dendritic cell with CD4+ T cells and determining the level of CD4+ T cell stimulation, thereby identifying a transformed host cell that expresses at least one CD4+ T cell stimulating antigen; and d) isolating DNA from the transformed host cell.
 16. The method of claim 51 wherein the level of CD4+ T cell stimulation is determined by measuring production of at least one cytokine.
 17. The method of claim 16 wherein the cytokine is interferon-gamma.
 18. The method of claim 15 wherein the level of CD4+ T cell stimulation is determined by measuring cell proliferation.
 19. The method of claim 15 wherein the plasmid contains DNA isolated from an infectious disease agent.
 20. The method of claim 19 wherein the infectious disease agent is M. tuberculosis. 